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PREFACE TO THE SECOND EDITION 


THE first edition of this volume of the Course of Theoretical Physics was published 
in two parts (1971 and 1974) under the title ‘Relativistic Quantum Theory’’. It 
contained not only the basic material on quantum electrodynamics but also chap- 
ters on weak interactions and certain topics in the theory of strong interactions. 
The inclusion of those chapters now seems to us inopportune. The theory of strong 
and weak interactions is undergoing a vigorous development founded on new 
physical ideas, and the situation in this field is changing very rapidly, so that the 
time for a consistent exposition of the theory has not yet arrived. In the present 
edition, therefore, we have retained only quantum electrodynamics, and accord- 
ingly changed the title of the volume. 

As well as a considerable number of corrections and minor changes, we have 
made in this edition several more significant additions, including the operator 
method of calculating the bremsstrahlung cross-section, the calculation of the 
probabilities of photon-induced pair production and photon decay in a magnetic 
field, the asymptotic form of the scattering amplitudes at high energies, inelastic 
scattering of electrons by hadrons, and the transformation of electron—positron 
pairs into hadrons. 

A word regarding notation. We have reverted to the use of circumflexed letters 
for operators, in line with the other volumes in the Course. No special notation is 
used for the product of a 4-vector and a matrix vector y", previously denoted by a 
circumflexed letter; such products are now shown explicitly. 

We have, alas, had to prepare this edition without the aid of Vladimir Berestets- 
kii, who died in 1977; but some of the added material mentioned above had been 
put together previously, by the three authors jointly. 

Our sincere thanks are offered to all readers who have given us their comments 
on the first edition of the book, and in particular to J. S. Bell, V. P. Krainov, L. B. 
Okun’, V. I. Ritus, M. I. Ryazanov and I. S. Shapiro. 


July 1979 E. M. LIFSHITZ 
L. P. PITAEvskii 


FROM THE PREFACE TO 
THE FIRST EDITION 


IN ACCORDANCE with the general plan of this Course of Theoretical Physics, the 
present volume deals with relativistic quantum theory in the broad sense: the 
theory of all phenomena which depend upon the finite velocity of light, including 
the whole of the theory of radiation. 

This branch of theoretical physics is still far from completion, even as regards 
its basic physical principles, and this is particularly true of the theory of strong and 
weak interactions. But even quantum electrodynamics, despite the remarkable 
achievements of the last twenty years, still lacks a satisfactory logical structure. 

In the choice of material for this book we have considered only results which 
appear to be reasonably firmly established. In consequence, of course, the greater 
part of the book is devoted to quantum electrodynamics. We have tried to give a 
realistic exposition, with emphasis on the physical hypotheses used in the theory, 
but without going into details of justifications, which in the present state of the 
theory are in any case purely formal. 

In the discussion of specific applications of the theory, our aim has been not to 
include the whole vast range of effects but to select only the most fundamental of 
them, adding some references to original papers which contain more detailed 
studies. We have often omitted some of the intermediate steps in the calculations, 
which in this subject are usually very lengthy, but we have always sought to 
indicate any non-trivial point of technique. 

The discussion in this book demands a higher degree of previous knowledge on 
the part of the reader than do the other volumes in the Course. Our assumption has 
been that a reader whose study of theoretical physics has extended as far as the 
quantum theory of fields has no further need of predigested material. 

This book has been written without the direct assistance of our teacher, L. D. 
Landau. Yet we have striven to be guided by the spirit and the approach to 
theoretical physics which characterized his teaching of us and which he embodied 
in the other volumes. We have often asked ourselves what would be the attitude of 
Dau to this or that topic, and sought the answer prompted by our many years’ 
association with him. 

Our thanks are due to V. N. Baier, who gave great help in compiling $890 and 
97, and to V. I. Ritus for great help in writing §101. We are grateful to B. E. 
Meierovich for assistance with calculations, and also to A. S. Kompaneets, who 
made available his notes of L. D. Landau’s lectures on quantum electrodynamics, 
given at Moscow State University in the academic year 1959-60. 


June 1967 V. B. BERESTETSKIi, E. M. Lirsuirz, L. P. PITAEVSkKti 
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NOTATION 


Four-dimensional 


Four-dimensional tensor indices are denoted by Greek letters A, wp, v,..., taking 
the values 0, 1, 2, 3. 
A 4-metric with signature (+ — — —) is used. The metric tensor is 


2.v( Zoo = 1, 21 = 22 = 933 = —1). 


Components of a 4-vector are stated in the form a” = (a’°, a). 

To simplify the formulae, the index is often omitted in writing the components 
of a 4-vector.t The scalar products of 4-vectors are written simply as (ab) or ab; 
ab =a,b" = agby—a-b. 

The 4-position-vector is x“=(t, r). The 4-volume element is d*x. 

The operator of differentiation with respect to the 4-coordinates is 0, = d/dx". 

The antisymmetric unit 4-tensor is e*’”’, with e°!7? = —e@,.3= 41. 

The four-dimensional delta function 5(a)=6(do)8(a). 


Three-dimensional 


Three-dimensional tensor indices are denoted by Latin letters i, k,1,..., taking 
the values x, y, z. 

Three-dimensional vectors are denoted by letters in bold type. 

The three-dimensional volume element is d?x. 


Operators 


Operators are denoted by italic letters with circumflex.+ : : 
Commutators or anticommutators of two operators are written {f, o}. = fé + Sf. 
The transposed operator is f. 

The Hermitian conjugate operator is f*. 


Matrix elements 


The matrix element of the operator F for a transition from initial state i to final 
state f is Fj or (f|Fli). 


+ This way of writing the components is often used in recent literature. It is a compromise between 
the limited resources of the alphabet and the demands of physics, and means, of course, that the reader 
must be particularly attentive. 

t However, to simplify the formulae, the circumflex is not written over spin matrices, and it is also 
omitted when operators are shown in matrix elements. 


Xill 


X1V Notation 


The notation |i) is used as an abstract symbol for a state independently of any 
specific representation in which its wave function may be expressed. The notation 
(f| denotes a final (““complex conjugate’) state.} 

Correspondingly, (s|r) denotes the coefficients in the expression of a set of 
states with quantum numbers r as superpositions of states with quantum numbers 


s: |r)= 2 [s)<s{r). 


The reduced matrix elements of spherical tensors are (f||Flji). 


Dirac’s equation 


The Dirac matrices are y“, with (y° = 1, (y'! =(y’° = (7° = -1. The matrix 
a= yy, B =v’. The expressions in the spinor and standard representations are 
(21.3), (21.16) and (21.20). 

y=—iy’y'y’y’, Oy = 1; see (22.18). 

ot =2(y"y" — y’y#); see (28.2). 

Dirac conjugation is expressed by = *y°. 

The Pauli matrices are o = (o,, oy, a), defined in §20. 

The 4-spinor indices are a, B,... and a, B, ..., taking the values 1, 2 and 1, 2. 

The bispinor indices are i,k, /,..., taking the values 1, 2, 3, 4. 


Fourier expansion 


Three-dimensional: 


fi)=f fae", fay = | fey eds, 


and similarly for the four-dimensional expansion. 


Units 


Except where otherwise specified, relativistic units are used, with h = 1, c = 1. 
In these units, the square of the unit charge is e” = 1/137. 

Atomic units have e=1, h = 1, m = 1. In these units, c = 137. The atomic units 
of length, time and energy are h*/me’, h?/me* and me*/h’; the quantity Ry = 
me‘/2h° is called a rydberg. 

Ordinary units are given in the absolute (Gaussian) system. 


Constants 


Velocity of light c = 2.998 x 10" cm/sec. 
Unit charget |e|=4.803x 10" CGS electrostatic units. 


+ This notation is due to Dirac. 
t Throughout the book (except in Chapter XIV), e denotes the charge with the appropriate sign, so 
that e = —|e| for an electron. 


Notation XV 


Electron mass m = 9.11 108g. 


Planck’s constant h = 1.055 x 10°”’ erg. sec. 

Fine-structure constant a = e7/hc; 1/a = 137.04. 

Bohr radius h?/me? = 5.292 x 10°? cm. 

Classical electron radius r, = e*/mc? = 2.818 x 10°° cm. 
Compton wavelength of the electron fi/mc = 3.862 x 10°'' cm. 
Electron rest energy mc’ = 0.511 x 10° eV. 

Atomic energy unit me*/h? = 4.360 x 10°"' erg = 27.21 eV. 
Bohr magneton |e|fi/2mc = 9.274 x 10°" erg/G. 

Proton mass m, = 1.673 x 10°“ g. 

Compton wavelength of the proton f/m,c = 2.103 x 10°'* cm. 
Nuclear magneton |e|4/2m,c = 5.051 x 10°“ erg/G. 

Mass ratio of muon and electron m,/m = 2.068 x 10°. 


References to volumes in the Course of Theoretical Physics: 


Mechanics = Vol. 1 (Mechanics, third English edition, 1976). 

Fields = Vol. 2 (The Classical Theory of Fields, fourth English edition, 1975). 

QM or Quantum Mechanics = Vol. 3 (Quantum Mechanics, third English 
edition, 1977). 

ECM = Vol. 8 (Electrodynamics of Continuous Media, English edition, 1960). 

PK = Vol. 10 (Physical Kinetics, English edition, 1981). 


All are published by Pergamon Press. 


INTRODUCTION 


$1. The uncertainty principle in the relativistic case 


THE quantum theory described in Volume 3 (Quantum Mechanics) is essentially 
non-relativistic throughout, and is not applicable to phenomena involving motion at 
velocities comparable with that of light. At first sight, one might expect that the 
change to a relativistic theory is possible by a fairly direct generalization of the 
formalism of non-relativistic quantum mechanics. But further consideration shows 
that a logically complete relativistic theory cannot be constructed without invoking 
new physical principles. 

Let us recall some of the physical concepts forming the basis of non-relativistic 
quantum mechanics (QM, $1). We saw that one fundamental concept is that of 
measurement, by which is meant the process of interaction between a quantum 
system and a classical obiect or apparatus, causing the quantum system to acquire 
definite values of some particular dynamical variables (coordinates, velocities, 
etc.). We saw also that quantum mechanics greatly restricts the possibility that an 
electron{ simultaneously possesses values of different dynamical variables. For 
example, the uncertainties Ag and Ap in simultaneously existing values of the 
coordinate and the momentum are related by the expressiont AgAp ~h; the 
greater the accuracy with which one of these quantities is measured, the less the 
accuracy with which the other can be measured at the same time. 

It is important to note, however, that any of the dynamical variables of the 
electron can individually be measured with arbitrarily high accuracy, and in an 
arbitrarily short period of time. This fact is of fundamental importance throughout 
non-relativistic quantum mechanics. It is the only justification for using the concept 
of the wave function, which is a basic part of the formalism. The physical 
significance of the wave function w(q) is that the square of its modulus gives the 
probability of finding a particular value of the electron coordinate as the result of a 
measurement made at a given instant. The concept of such a probability clearly 
requires that the coordinate can in principle be measured with any specified 
accuracy and rapidity, since otherwise this concept would be purposeless and 
devoid of physical significance. 

The existence of a limiting velocity (the velocity of light, denoted by c) leads to 
new fundamental limitations on the possible measurements of various physical 
quantities (L. D. Landau and R. E. Peierls, 1930). 


+ As in QM, §1, we shall, for brevity, speak of an ‘“‘electron’’, meaning any quantum system. 
t In this section, ordinary units are used. 
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In QM, 844, the following relationship has been derived: 
(v'—v)ApAt ~ fh, (1.1) 


relating the uncertainty Ap in the measurement of the electron momentum and the 
duration At of the measurement process itself; v and v’ are the velocities of the 
electron before and after the measurement. From this relationship it follows that a 
momentum measurement of high accuracy made during a short time (i.e. with Ap 
and At both small) can occur only if there is a large change in the velocity as a 
result of the measurement process itself. In the non-relativistic theory, this showed 
that the measurement of momentum cannot be repeated at short intervals of time, 
but it did not at all diminish the possibility, in principle, of making a single 
measurement of the momentum with arbitrarily high accuracy, since the difference 
v'—v could take any value, no matter how large. 

The existence of a limiting velocity, however, radically alters the situation. The 
difference v’ — v, like the velocities themselves, cannot now exceed c (or rather 2c). 
Replacing v’— v in (1.1) by c, we obtain 


ApAt ~ hic, (1.2) 


which determines the highest accuracy theoretically attainable when the momen- 
tum is measured by a process occupying a given time At. In the relativistic theory, 
therefore, it is in principle impossible to make an arbitrarily accurate and rapid 
measurement of the momentum. An exact measurement (Ap > 0) is possible only in 
the limit as the duration of the measurement tends to infinity. 

There is reason to suppose that the concept of measurability of the electron 
coordinate itself must also undergo modification. In the mathematical formalism of 
the theory, this situation is shown by the fact that an accurate measurement of the 
coordinate is incompatible with the assertion that the energy of a free particle is 
positive. It will be seen later that the complete set of eigenfunctions of the 
relativistic wave equation of a free particle includes, as well as solutions having the 
“correct” time dependence, also solutions having a “negative frequency’. These 
functions will in general appear in the expansion of the wave packet corresponding 
to an electron localized in a small region of space. 

It will be shown that the wave functions having a “negative frequency” 
correspond to the existence of antiparticles (positrons). The appearance of these 
functions in the expansion of the wave packet expresses the (in general) inevitable 
production of electron—positron pairs in the process of measuring the coordinates 
of an electron. This formation of new particles in a way which cannot be detected 
by the process itself renders meaningless the measurement of the electron coor- 
dinates. 

In the rest frame of the electron, the least possible error in the measurement of 
its coordinates 1s 


Aq ~ hime. (1.3) 


This value (which purely dimensional arguments show to be the only possible one) 
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corresponds to a momentum uncertainty Ap ~ mc, which in turn corresponds to the 
threshold energy for pair production. 

In a frame of reference in which the electron is moving with energy e, (1.3) 
becomes 


Aq ~ ch/e. (1.4) 


In particular, in the limiting ultra-relativistic case the energy is related to the 
momentum by « ~ cp, and 


Aq ~ hip, (1.5) 


i.e. the error Ag is the same as the de Broglie wavelength of the particle. 

For photons, the ultra-relativistic case always applies, and the expression (1.5) 
is therefore valid. This means that the coordinates of a photon are meaningful only 
in cases where the characteristic dimensions of the problem are large in com- 
parison with the wavelength. This is just the “‘classical’’ limit, corresponding to 
geometrical optics, in which radiation can be said to be propagated along definite 
paths or rays. In the quantum case, however, where the wavelength cannot be 
regarded as small, the concept of coordinates of the photon has no meaning. We 
shall see later ($4) that, in the mathematical formalism of the theory, the fact that 
the photon coordinates cannot be measured is evident because the photon wave 
function cannot be used to construct a quantity which might serve as a probability 
density satisfying the necessary conditions of relativistic invariance. 

The foregoing discussion suggests that the theory will not consider the time 
dependence of particle interaction processes. It will show that in these processes 
there are no characteristics precisely definable (even within the usual limitations of 
quantum mechanics); the description of such a process as occurring in the course 
of time is therefore just as unreal as the classical paths are in non-relativistic 
quantum mechanics. The only observable quantities are the properties (momenta, 
polarizations) of free particles: the initial particles which come into interaction, and 
the final particles which result from the process (L. D. Landau and R. E. Peierls, 
1930). 

A typical problem as formulated in relativistic quantum theory is to determine 
the probability amplitudes of transitions between specified initial and final states 
(t > +0) of a system of particles. The set of such amplitudes between all possible 
states constitutes the scattering matrix or S-matrix. This matrix will embody all the 
information about particle interaction processes that has an observable physical 
meaning (W. Heisenberg, 1938). 

There is as yet no logically consistent and complete relativistic quantum theory. 
We shall see that the existing theory introduces new physical features into the 
nature of the description of particle states, which acquires some of the features of 


+ The measurements in question are those for which any experimental result yields a conclusion 
about the state of the electron; that is, we are not considering coordinate measurements by means of 
collisions, when the result does not occur with probability unity during the time of observation. 
Although the deflection of a measuring-particle in such cases may indicate the position of an electron, 
the absence of a deflection tells us nothing. 


4 Introduction §1 


field theory (see $10). The theory is, however, largely constructed on the pattern of 
ordinary quantum mechanics. This structure of the theory has yielded good results 
in quantum electrodynamics. The lack of complete logical consistency in this 
theory is shown by the occurrence of divergent expressions when the mathematical 
formalism is directly applied, although there are quite well-defined ways of eli- 
minating these divergences. Nevertheless, such methods remain, to a considerable 
extent, semiempirical rules, and our confidence in the correctness of the results is 
ultimately based only on their excellent agreement with experiment, not on the 
internal consistency or logical ordering of the fundamental principles of the theory. 


CHAPTER I 


PHOTONS 


§2. Quantization of the free electromagnetic field 


WITH the purpose of treating the electromagnetic field as a quantum object, it is 
convenient to begin from a classical description of the field in which it is 
represented by an infinite but discrete set of variables. This description permits the 
immediate application of the customary formalism of quantum mechanics. The 
representation of the field by means of potentials specified at every point in space 
is essentially a description by means of a continuous set of variables. - 

Let A(r, t) be the vector potential of the free electromagnetic field, which 
satisfies the “‘transversality condition” 


div A= 0. (2.1) 
The scalar potential ® = 0, and the fields E and H are 
E=—A, H = curl A. (2.2) 
Maxwell’s equations reduce to the wave equation for A: 
AA — 0°A/at? = 0. (2.3) 
In classical electrodynamics (see Fields, $52) the change to the description by 
means of a discrete set of variables is brought about by considering the field in a 
large but finite volume V.t The following 1s a brief résumé of the argument. 


The field in a finite volume can be expanded in terms of travelling plane waves, 
and its potential is then represented by a series 


A= >) (a, e* T+aé ee"), (2.4) 
k 


where the coefficients a, are functions of the time such that 
a~e™, w=\|kl. (2.5) 


The condition (2.1) shows that the complex vectors a, are orthogonal to the 
corresponding wave vectors: a, °-k= 0. 
The summation in (2.4) is taken over an infinite discrete set of values of the 


~ We shall take V = 1, in order to reduce the number of factors in the formulae. 
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wave vector (i.e. of its components k,, ky, k,). The change to an integral over a 
continuous distribution may be made by means of the expression d°*k/(27r)’ for the 
number of possible values of k belonging to the volume element d*k = dk,dk,dk, in 
k-space. 

If the vectors a, are specified, the field in the volume considered is completely 
determined. Thus these quantities may be regarded as a discrete set of classical 
‘‘field variables’’. In order to explain the transition to the quantum theory, however, 
a further transformation of these variables is needed, whereby the field equations 
take a form analogous to the canonical equations (Hamilton’s equations) of 
classical mechanics. The canonical field variables are defined by 


I ” 
Qx = (ay ae * ab), ve 


| a Tay ~ayt)= Q,., 


and are evidently real. The vector potential is expressed in terms of the canonical 
variables by 


A=V (40) yy, (2, cosk:-r— + P,, sink - r). (2:7) 
k 
To find the Hamiltonian H, we must calculate the total energy of the field, 
g, | +H ax 
8 ar ‘ 


and express it in terms of the Q, and P,. When A is written as the expansion (2.7), 
and E and H are found from (2.2), the result of the integration is 


H =3¥ i+ 07Q). 
k 


Each of the vectors P, and Q, 1s perpendicular to the wave vector k, and 
therefore has two independent components. The direction of these vectors deter- 
mines the direction of polarization of the corresponding wave. Denoting the two 
components of the vectors Q, and P, (in the plane perpendicular to k) by Qka, Pra 
(a = 1, 2), we can write the Hamiltonian as 


H = 3) (Pia + @ Qia). (2.8) 
k,a 


Thus the Hamiltonian is the sum of independent terms, each of which contains 
only one pair of quantities Q,., Pye. Each such term corresponds to a travelling 
wave with a definite wave vector and polarization, and has the form of the 
Hamiltonian for a one-dimensional harmonic oscillator. This expansion is therefore 
often referred to as an oscillator expansion of the field. 
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Let us now consider the quantization of the free electromagnetic field. The 
classical description of the field given above makes the manner of transition to the 
quantum theory obvious. We have now to use canonical variables (generalized 
coordinates Q,, and generalized momenta P,,,) as operators, with the commutation 
rule 


POs _ OQkaP ra = =i; (2.9) 


operators with different values of k and a always commute. The potential A and, 
according to (2.2), the fields E and H likewise become (Hermitian) operators. 
The consistent determination of the Hamiltonian requires the calculation of the 
integral 
Ha | (F2 + AP) dx, (2.10) 


T 


in which E and H are expressed in terms of P,,. and Oka: However, the fact that the 
latter do not commute is actually unimportant, since the products OkoP io appear 
multiplied by cosk:rsink:r, which becomes zero on integration over the whole 
volume. The resulting expression for the Hamiltonian is therefore 


H => (Pi. + w’Qia), (2.11) 
k, a 


which is, as we might have expected, exactly the same in form as the classical 
Hamiltonian. 

The determination of the eigenvalues of this Hamiltonian involves no further 
calculation, since it is equivalent to the familiar problem of the energy levels of 
linear oscillators (QM, §23). We can therefore immediately write down the field 
energy levels: 


E => (Na +30, (2.12) 
k, a 


where the N,. are integers. 

The further discussion of this formula will be left until §3; here we shall write 
out the matrix elements of the quantities Q,,., which can be done at once by means 
of the known formulae for the matrix elements of the coordinates of an oscillator 
(see QM, §23). The non-zero matrix elements are 


(Nica | Qka| Nica cae 1) = (Nka = 1] Qka| Nia) 
-_ V(Nykol2@). (2.13) 


The matrix elements of the quantities Py. = Qya differ from those of Qx. only by a 
factor tio. 

In subsequent calculations, however, it will be more convenient to replace the 
quantities Q,, and P,, by the linear combinations wQ,. + iPx., which have non-zero 
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matrix elements only for transitions Nxa > Na +1. We therefore define the opera- 
tors 


ee oe ee 
Cra = Vo) (WQka + 1Pka), 
(2.14) 


the classical quantities c,., ci, are the same, apart from a factor V(27/w), as the 
coefficients a,,, ait, in the expansion (2.4). The matrix elements of these operators 
are 


(Ne = 1] Cxa|Nxa) a (Na |€ ka] N ka a 1) 
= V Nqo- (2.15) 


The commutation rule for ¢,, and ¢,, is obtained by using the definitions (2.14) and 
the rule (2.9): 


Chal ka — CeCe = by (2.16) 


For the vector potential, we return to an expansion of the type (2.4), but with 
operator coefficients, writing it in the form 


A= DS. (Ca Ae + CcA®), (2.17) 
k, a 
where 
A/, a iker 
Axa = V (477) (Qe) a (2.18) 


The symbol e denotes the unit vectors in the direction of polarization of the 
oscillators; these vectors are perpendicular to the wave vector k, and for every k 
there are two independent polarizations. 

Similarly, for the operators E and H we write 


E — > (Cro Eka oF Chk), 
k, a 


(2.19) 
H ae > (Ca Hike + CxoH*,), 
k, a 
with 
Exo = iw Axes H... =nx Eras n= k/w. (2.20) 
The vectors A,. are mutually orthogonal, in the sense that 
4 2 
i : Ax. d°x = os Sova! Okk's (2.21) 
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For, if A, and Af, belong to different wave vectors, then their product contains a 
factor e’“*)'', which gives zero on integration over the volume; if they differ only 
in polarization, e : e’* =0, since the two independent directions of polarization 
are mutually orthogonal. Similar arguments apply to the vectors Ex, and H,.. They 
are conveniently normalized by imposing the condition 


iz i (Ex. : Ek + Hix. ’ Hi.) d°x = WO kk'Saa'- (2.22) 


Substituting the operators (2.19) in (2.10), and carrying out the integration by 
means of (2.22), we obtain the field Hamiltonian expressed in terms of the 
operators ¢, ¢": 

H a >» 50 (Cha ka + C ka Cka)- (2.23) 


k, a 


This operator is diagonal in the representation considered (the matrix elements of 
the operators ¢ and ¢” being given by (2.15)), and its eigenvalues are of course 
(2.12). 

In the classical theory, the field momentum ts defined as the integral 


p= 7 | EXH d's. 
Aor 


In changing to the quantum theory, we replace E and H by the operators (2.19), and 
thus easily find 


P= ¥ 5(Pi. + w’ Qin, (2.24) 
k, a 


in agreement with the familiar classical relationship between the energy and 
momentum of plane waves. The eigenvalues of this operator are 


P= >) K(Nxa +3). (2.25) 


The representation of operators by means of the matrix elements (2.15) is the 
‘occupation number representation”, corresponding to the description of the state 
of a system (the field) by specifying the quantum numbers N,, (the occupation 
numbers). In this representation the field operators (2.19), and therefore the 
Hamiltonian (2.11), act on the wave function of the system, expressed in terms of 
the numbers N,,; let this be ®(N,., t). The field operators (2.19) are not explicit 
functions of the time. This corresponds to the customary Schrodinger represen- 
tation of operators in non-relativistic quantum mechanics. The state of the system, 
@P(Nio, t), does depend on the time, and this dependence is governed by 
Schr6édinger’s equation, 


iaD/at = HO. 
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This description of the field is, by its nature, relativistically invariant, since it is 
based on the invariant Maxwell’s equations. But this invariance is not explicitly 
shown, primarily because the space coordinates and the time appear in the 
description in a highly asymmetric manner. 

In relativistic theory, it is convenient to put the description in a form which is 
more obviously invariant. To do so, we must use what is called the Heisenberg 
representation, in which the explicit time dependence is transferred to the opera- 
tors themselves (see QM, §13). Then the time and the coordinates will appear on an 
equal footing in the expressions for the field operators, and the state of the system, 
®, will depend only on the occupation numbers. 

For the operator A, the change to the Heisenberg representation amounts to 
replacing the factor e‘*’' in each term of the sum (2.17) by e““"" ®”, i.e. to regarding 
the A,, as the time-dependent functions 


Axa = V(477) con en, (2.26) 


This is easily proved by noticing that the matrix element of the Heisenberg 
operator for the transition i>f must include a factor exp {—i(E; — E;)t}, where E; 
and E; are the energies of the initial and final states (see QM, $13). For a transition 
in which N, decreases or increases by 1, this factor becomes e” or e’ respec- 
tively, a condition which is satisfied by effecting the change mentioned above. 
Henceforward, in discussing both the electromagnetic field and particle fields, 


we Shall always assume that the Heisenberg representation of operators is used. 
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We shall now further analyse the field quantization formulae obtained in §2. 

First of all, formula (2.12) for the field energy raises the following difficulty. The 
lowest energy level of the field corresponds to the case where the quantum 
numbers N,. of all the oscillators are zero; this is called the electromagnetic field 
vacuum state. But, even in that state, each oscillator has a non-zero ‘‘zero-point 
energy” equal to 3m. Summation over an infinite number of oscillators then gives an 
infinite result. Thus we meet with one of the ‘‘divergences’’ which are due to the 
fact that the present theory is not logically complete and consistent. 

So long as only the field energy eigenvalues are under discussion, we can 
remove this difficulty by simply striking out the zero-point oscillation energy, i.e. 
by writing the field energy and momentum ast 


E= > Nxo®, P= > Nyak. (3.1) 


+ This procedure can be formally carried out without contradiction if we agree to regard the 
products of operators in (2.10) as “‘normal” products, that is, as products in which the operators ¢* are 
always placed to the left of the operators ¢. Then formula (2.23) becomes 


A Ata 
H = > 06 ka Cha. 
5a 
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These formulae enable us to introduce the concept of radiation quanta or 
photons, which is fundamental throughout quantum electrodynamics.t We may 
regard the free electromagnetic field as an ensemble of particles each with energy w 
(=hw) and momentum k (=nhaw/c). The relationship between the photon energy 
and momentum is as it should be in relativistic mechanics for particles having zero 
rest-mass and moving with the velocity of light. The occupation numbers N,. now 
represent the numbers of photons having given momentum k and polarization e‘. 
The polarization of the photon is analogous to the spin of other particles; the exact 
properties of the photon in this respect will be discussed in §6 below. 

It is easily seen that the whole of the mathematical formalism developed in §2 is 
fully in accordance with the representation of the electromagnetic field as an 
ensemble of photons; it is just the second quantization formalism, applied to the 
system of photons.# In this treatment (see QM, 864), the independent variables are 
the occupation numbers of the states, and the operators act on functions of these 
numbers. The particle ‘‘annihilation” and ‘‘creation” operators are of basic im- 
portance; they respectively decrease and increase by one the occupation numbers. 
The ¢,. and ¢,, are operators of this kind: ¢,, annihilates a photon in the state k, a, 
and ¢,, creates a photon in that state. 

The commutation rule (2.16) corresponds to particles which obey Bose statis- 
tics. Photons, therefore, are bosons, as was to be expected, since the number of 
photons that can be in any one state must be unrestricted. The significance of this 
will be further discussed in §5. 

The plane waves A,,. (2.26) which appear in the operator A (2.17) as coefficients 
of the photon annihilation operators may be treated as the wave functions of 
photons having given momenta k and polarizations e‘”. This corresponds to an 
expansion of the W-operator in terms of the wave functions of stationary states of a 
particle in the non-relativistic second quantization formalism; however, unlike the 
latter, the expansion (2.17) includes both particle annihilation and particle creation 
operators. The meaning of this difference is explained in $12. 

The wave function (2.26) is normalized by the condition 


m4 
| 7a (Bal? + [Hue dx = (3.2) 


This is the normalization to ‘‘one photon in the volume V = 1”: the integral on the 
left is the quantum-mechanical mean value of the photon energy in the state having 
the given wave function. § The right-hand side of (3.2) is just the energy of a single 
photon. 

The ‘‘Schr6dinger’s equation” for the photon is represented by Maxwell’s 
equations. In the present case (when the potential A(r, t) satisfies the condition 


+ This concept is originally due to A. Einstein (1905). 

+ The application of the second quantization method to the theory of radiation was first worked out 
by P. A. M. Dirac (1927). 

§ It should be noted that the factor !/47 in the integral (3.2) is twice the usual factor 1/87 (2.10). 
This is ultimately due to the fact that the vectors Exe, Hx. are complex, whereas the field operators E, H 
are Hermitian. 
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(2.1)), this leads to the wave equation: 
a’A/at?- AA= 0. 


The ‘‘wave functions” of the photon, in the general case of arbitrary stationary 
states, are complex solutions of this equation, whose time dependence is given by 
the factor ee“. 

In referring to the photon wave function, we must again emphasize that this can 
not be regarded as the probability amplitude of the spatial localization of the 
photon, in contrast to the fundamental significance of the wave function in 
non-relativistic quantum mechanics. This is because, as has been shown in $1, the 
concept of the coordinates of the photon has no physical meaning. The mathema- 
tical aspect of this situation will be further discussed at the end of $4. 

The components of the Fourier expansion of the function A(r, t) with respect to 
the coordinates form the wave function of the photon in the momentum represen- 
tation; we denote this by A(k, t) = A(k) e “. For example, in a state with a given 
momentum k and polarization e’”, the wave function in the momentum represen- 
tation is given simply by the coefficient of the exponential factor in (2.26): 


(a) 
Axa(k’, a) = V(47r) VOD) SiSa'a (3.3) 


Since the momentum of a free particle is measurable, the wave function in the 
momentum representation has a more profound physical significance than that in 
the coordinate representation: it enables us to calculate the probabilities w,, of 
various values of the momentum and polarization of a photon in a specified state. 
According to the general rules of quantum mechanics, w,,. is given by the square of 
the modulus of the corresponding coefficient in the expansion of the function A(k’) 


in terms of the wave functions of states with given k and e™: 


2 
Wea & | > AB(K’, a’) A(k’)] , 
kK’, a! 


the proportionality coefficient depending on the way in which the functions are 
normalized. Substitution of (3.3) gives 


Wea * |e + A(k)]’. (3.4) 


Summation over the two polarizations gives the probability that the photon 
momentum is k: 


wy & |A(K)2. (3.5) 
§4. Gauge invariance 


The field potential in classical electrodynamics is well known to be subject to an 
arbitrary choice: the components of the 4-potential A, can undergo any gauge 
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transformation of the form 
A, > A, + 0.x; (4.1) 


where y is any function of coordinates and time (see Fields, $18). 

For a plane wave, if we consider only transformations which do not change the 
form of the potential (proportional to exp (—ik,x")), the freedom of choice reduces 
to the possibility of adding to the wave amplitude any 4-vector proportional to k’. 

This arbitrariness in the potential persists in the quantum theory, of course, 
where it relates to the field operators or to the wave functions of photons. In order 
not to prejudice the choice of the potentials, we must replace (2.17) by the 
corresponding expansion for the operator 4-potential, 


AY = SCeAt +e Ar), (4.2) 
k, a 


where the wave functions Af, are 4-vectors of the form 


e” ex” 


Ak = V (47) (Jo) 


e,e"* = —1, 
or more concisely, omitting the four-dimensional vector indices, 


A, = V(40) Ba ek og = 1, (4.3) 


Here the 4-momentum k" =(w, k) (and so kx =wt—k-r), and e is the unit 
polarization 4-vector.t 

If we consider only gauge transformations which do not alter the dependence of 
the function (4.3) on the coordinates and the time, the transformation must be 


é. > &, + xk,, (4.4) 


where y = x(k") is an arbitrary function. Since the polarization is transverse, it is 
always possible to choose a gauge such that the 4-vector e is 


e” = (0, e), e-k=0; (4.5) 


this will be called the three-dimensionally transverse gauge. In invariant four- 
dimensional form, this condition becomes the condition of four-dimensional 
transversality 


ek = 0. (4.6) 


+ The expression (4.3) is not in a fully relativistic-covariant (4-vector) form; this is because the 
normalization to a finite volume V = 1, used here, is not invariant. This is, however, of no fundamental 
importance, and is entirely compensated by the advantages of the normalization used. We shall see later 
that it allows a simple and straightforward deduction of actual physical quantities in the necessary 
invariant form. 


QE4-C 
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It should be noticed that this condition (like the normalization condition 
ee* = —1) is preserved by the transformation (4.4), since k? = 0. If the square of the 
4-momentum of a particle is zero, its mass must also be zero. This demonstrates 
the relationship between gauge invariance and the zero mass of the photon. Other 
aspects of the relationship will be discussed in §14. 

There can be no change in any measurable physical quantities under a gauge 
transformation of the wave functions of photons concerned in a process. In 
quantum electrodynamics this requirement of gauge invariance is of even greater 
importance than in the classical theory. We shall see many examples of the fact 
that gauge invariance is here, like relativistic invariance, a valuable heuristic 
principle. 

Gauge invariance is, in turn, closely related to the law of conservation of 
electric charge. This aspect will be discussed in $43. 

It has already been mentioned in §3 that the coordinate wave function of the 
photon cannot be interpreted as the probability amplitude of its spatial localization. 
Mathematically, this is shown by the impossibility of constructing from the wave 
function any quantity which has even the formal properties of a probability 
density. Such a quantity would have to be expressed as a positive-definite bilinear 
combination of the wave function A, and its complex conjugate. Moreover, it 
would have to satisfy certain conditions of relativistic covariance by being the time 
component of a 4-vector. This is because the continuity equation, which expresses 
the conservation of the number of particles, 1s given in four-dimensional form by 
the vanishing of the divergence of the current 4-vector. The time component of the 
current is here the particle localization probability density; see Fields, $29. On the 
other hand, by the condition of gauge invariance, the 4-vector A, could appear in the 
current only as the antisymmetric tensor F,, = 0,A, — 0,A, = —i(k,A, — k,A,). Thus 
the current 4-vector would have to be a bilinear combination of F,, and F%, (and 
the components of the 4-vector k,,). But such a 4-vector cannot be formed, since 
every expression (such as k*F*,F 4) which satisfies the conditions stated is zero by 
the transversality condition (k*F,, =0), and in any case could not be positive- 
definite, since it contains odd powers of the components k,,. 


§5. The electromagnetic field in quantum theory 


The description of the field as an ensemble of photons is the only description 
that fully accords with the physical significance of the electromagnetic field in 
quantum theory. It replaces the classical description in terms of field strengths. 
These appear in the mathematical formalism of the photon picture as second 
quantization operators. 

The properties of a quantum system are Known to be similar to the classical 
properties when the quantum numbers defining the stationary states of the system 
are large. For a free electromagnetic field (in a given volume) this means that the 
oscillator quantum numbers, ie. the photon numbers N,,, must be large. In this 
respect the fact that photons obey Bose statistics is of great importance. In the 
mathematical formalism of the theory, the relationship of the Bose statistics to the 
properties of the classical field is Shown by the commutation rules for the operators 
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Cas Ck. When the N,., are large, and the matrix elements of these operators are 
therefore large also, we may neglect unity on the right-hand side of the com- 
mutation rule (2.16), obtaining 


these operators thus become the commuting classical quantities c,, and cit,, which 
determine the classical field strengths. 

The condition for the field to be quasi-classical needs to be made more precise, 
however, since, if all the numbers N,,, are large, the energy of the field is certainly 
infinite on summation over all the states k, a, and the condition then becomes 
meaningless. 

A physically meaningful statement of the problem as to the conditions for a 
quasi-classical field can be based on a consideration of values of the field averaged 
over some short time interval At. If the classical electric field E (or magnetic field 
H) is represented as a Fourier integral expansion with respect to the time, then, 
when it is averaged over the time interval At, only those Fourier components 
whose frequencies are such that wAt <1 will make a significant contribution to the 
mean value E, since otherwise the oscillating factor e almost vanishes on 
averaging. Thus, in determining the condition for the averaged field to be quasi- 
classical, we need consider only those quantum oscillators whose frequency 
w <1/At. It is sufficient that the quantum numbers of these oscillators should be 
large. 

The number of oscillators having frequencies between zero and w ~ 1/At (fora 
volume V = 1) is, in order of magnitude,t 


—twt 


(w/c) ~ 1/(cAty’. (5.1) 


The total field energy per unit volume is proportional to E’. Dividing this by the 
number of oscillators and by some mean value of the energy of a single photon 
(~hw), we find as the order of magnitude of the numbers of photons 


Nx sad E’°c3/ho’‘. 
With the condition that this number should be large, we obtain the inequality 
IE] > V(hc)/(c At)’. (5.2) 


This is the required condition, which allows the field averaged over time 
intervals At to be treated as classical. We see that the field must reach a certain 
strength, which increases as the averaging time At decreases. For variable fields, 
this time must not, of course, exceed the time during which the field changes 
appreciably. Thus variable fields, if sufficiently weak, can never be quasi-classical. 
Only for static (time-independent) fields can we make At ~~, so that the right-hand 
side of the inequality (5.2) tends to zero. Thus a static field is always classical. 


+t In this section, ordinary units are used. 
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It has already been mentioned that the classical expressions for the elec- 
tromagnetic field as a superposition of plane waves must be regarded in quantum 
theory as operator expressions. These operators, however, have only a very limited 
physical meaning. A physically meaningful field operator would have to give zero 
field values in the photon vacuum state, whereas the mean value of the squared 
field operator E’ in the ground state, which is the same as ihe zero-point energy of 
the field apart from a factor, is infinite; by the ‘‘mean value” is meant the 
quantum-mechanical mean value, i.e. the corresponding diagonal matrix element of 
the operator. This infinity cannot be avoided even by any formal cancelling 
operation (as was done for the field energy), since here this would have to be 
carried out by means of some appropriate modification of the operators E and H 
themselves (not their squares), which is impossible. 


§6. The angular momentum and parity of the photon 


The photon, like any other particle, can possess a certain angular momentum. In 
order to determine the properties of this quantity for the photon, let us first recall 
the relationship between the properties of the wave function of a particle and the 
angular momentum of the particle, in the mathematical formalism of quantum 
mechanics. 

The angular momentum j of a particle consists of its orbital angular momentum 
1 and its intrinsic angular momentum or spin s. The wave function of a particle 
having spin s is a symmetrical spinor of rank 25, i.e. is a set of 2s + 1 components 
which are transformed into definite combinations of one another when the coor- 
dinate axes are rotated. The orbital angular momentum is related to the way in 
which the wave functions depend on the coordinates: states with orbital angular 
momentum | correspond to wave functions whose components are linear com- 
binations of the spherical harmonic functions of order I. 

The consistent distinguishability of the spin and the orbital angular momentum 
therefore requires that the “‘spin’’ and ‘“‘coordinate”’ properties of the wave func- 
tions should be independent of each other: the dependence of the spinor com- 
ponents on the coordinates (at a given instant) must not be subject to any 
additional restrictions. 

In the momentum representation of the wave functions, their dependence on 
the coordinates is replaced by their dependence on the momentum k. The photon 
wave function (in the three-dimensionally transverse gauge) is the vector A(k). A 
vector is equivalent to a spinor of rank 2, and in this sense the photon might be said 
to have spin 1. But this vector wave function satisfies the transversality condition, 
k- A(k) = 0, which is a further condition imposed on the function A(k). Con- 
sequently, this function cannot be arbitrarily specified as regards every component 
of the vector at the same time, and therefore the orbital angular momentum and the 
spin cannot be strictly distinguished. 

The definition of the spin as the angular momentum of a particle at rest is also 
inapplicable to the photon, because there is no rest frame for a photon, which 
moves with the velocity of light. 
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Thus only the total angular momentum of the photon has a meaning. It is. 
moreover, obvious that this total angular momentum must be integral, since the 
quantities describing the photon do not include any spinors of odd rank. 

The state of a photon, like that of any particle, is also described by its parity. 
which refers to the behaviour of the wave function under inversion of the 
coordinates (see QM, $30). In the momentum representation, the change of sign of 
the coordinates is replaced by the change of sign of all the components of k. The 
effect of the inversion operator P on a scalar function (k) is simply to produce 
this change of sign: Pé(k) = 6(—k). When it is applied to a vector function A(k), we 
must also take into account the fact that the reversal of the directions of the axes 
changes the sign of all the components of the vector; hencet 


P A(k) = —A(-k). (6.1) 


Although the separation of the angular momentum of the photon into the orbital 
angular momentum and the spin has no physical meaning, it is nevertheless 
convenient to define a “spin” s and an “orbital angular momentum” | as formal 
auxiliary quantities which express the transformation properties of the wave 
function under rotations: the value s =1 corresponds to the fact that the wave 
function is a vector, and the value of | is the order of the spherical harmonics 
which occur in the wave function. Here we are considering the wave functions of 
states in which the photon angular momentum has a definite value; for a free 
particle, these are spherical waves. The number I, in particular, defines the parity of 
the photon state, which is 


P =(-1)". (6.2) 


In the same way, the angular momentum operator j may be represented as the 
sum §+]. The operator j is related to the operator of an infinitesimal rotation of the 
coordinates, or, in the present case, to the action of this operator on a vector field. 
In the sum §+1, the operator § acts on the vector index, transforming the 
components of the vector into combinations of one another. The operator I acts on 
these components as functions of the momentum (or of the coordinates). 

We may count the number of states (with a given energy) which are possible for 
a given value j of the photon angular momentum, ignoring the trivial (2j + 1)-fold 
degeneracy with respect to the directions of the angular momentum. 

When I and s are independent, this calculation is made by simply counting the 
number of ways in which the angular momenta I and s can be added, according to 
the rules of the vector model, so as to obtain the required value of j. For a particle 


+ We shall choose to define the parity of a state according to the effect of the inversion operator on a 
polar vector, such as A (or the corresponding electric vector E = iwA). This differs in sign from the 
effect on the axial vector H = ik x A, since the direction of such a vector is unaltered by inversion: 


PH(k) = H(—k). 
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with spin s = 1, and a given non-zero value of j, this would give three states, with 
the following values of | and the parity P: 


| = j P =(-1'= (1); 


l=j+i, P=(-1"=C1)y. 


If j = 0, however, only one state is obtained, with | = 1 and parity P = +1. 

In this calculation the condition that the vector A is transverse has not been 
taken into account; all its three components have been assumed to be in- 
dependent. We must therefore subtract, from the numbers of states found above, 
the numbers of states which correspond to a longitudinal vector. This vector may 
be written in the form kd(k), whence we see that its three components are 
equivalent, as regards their transformation properties (under rotations), to a single 
scalar @.t We can therefore say that the extra state which is incompatible with the 
transversality condition would correspond to the state of a particle having a scalar 
wave function (spinor of rank 0), i.e. having “‘spin zero”’.£ The angular momentum j 
of this state is therefore equal to the order of the spherical harmonics which occur 
in d@. The parity of the state as a state of the photon is determined by the action of 
the inversion operator on the vector function kd: 


P(k¢) = —(-k)¢(-k) = -Dikd(y), 


and is therefore (—1). Thus we must subtract one from the number of states found 
above which have the parity (—1), i.e. two for j#0 and one for j = 0. 

The conclusion is, then, that when the photon angular momentum j is non-zero 
there is one even state and one odd state. When j = 0, no states exist. This means 
that a photon cannot have zero angular momentum; j therefore takes only the 
values 1,2,3,.... The impossibility of } =0 is evident a priori, since the wave 
function of a state with zero angular momentum must be spherically symmetrical, 
and this cannot be true for a transverse wave. 

The following terminology is customary to denote the various states of the 
photon. A photon with angular momentum j and parity (—1)! is called an electric 
2'-pole (or Ej) photon; one with parity (—1)'*! is called a magnetic 2/-pole (or Mj) 
photon. For example, an odd state with j =1 corresponds to an electric dipole 
photon, an even state with j = 2 to an electric quadrupole photon, and an even state 
with j = 1 to a magnetic dipole photon.§ 


+ This is because the transformation of a quantity under rotation is a transformation at a given point, 
i.e. for a given value of k. Under such a transformation, k@(k) is unchanged, i.e. it behaves as a scalar. 

+t It should be again emphasized that this does not refer to a state of an actual particle. The 
calculation given here is a formal one, and amounts mathematically to a classification of the set of 
quantities which are transformed into combinations of one another, in terms of the irreducible 
representations of the rotation group. 

§ This nomenclature corresponds to the terminology of classical radiation theory; we shall see later 
(§§46, 47) that the emission of electric and magnetic photons is governed by the electric and magnetic 
moments of a system of charges. 
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87. Spherical waves of photons 


Having ascertained the possible values of the photon angular momentum, we 
must now determine the corresponding wave functions.f 

Let us first consider the formal problem of determining vector functions which 
are eigenfunctions of the operators i and j,, without deciding as yet which of these 
functions will appear in the desired photon wave functions, and without taking 
account of the transversality condition. 

We shall look for the functions in the momentum representation. In this 
representation, the coordinate operator is f = id/dk (see QM, (15.12)). The orbital 
angular momentum operator is 


I = xk = —ik x 0/dk, 


and therefore differs from the angular momentum operator in the coordinate 
representation only in that r is replaced by k. The solution of the problem is thus 
formally identical in the two representations. 

Let the required eigenfunctions be denoted by Yj, and referred to as spherical 
harmonic vectors. They must satisfy the conditions 


PY in = J+ DY im, 1:Y¥ im = MY im, (7.1) 


the z-axis being in a specified direction in space. We shall show that these 
conditions are satisfied by any function of the form aYjm, where a is any vector 
formed from the unit vector n=k/q@, and Y;, are the ordinary (scalar) spherical 
harmonic functions. The latter will everywhere be defined as in QM, §28: 


Yim (mn) = (— 12" j! V Ca aL P'"(cos 6) e'"*, (7.2) 


4n(1+ |m))! 


where 6 and ¢ are the spherical polar angles of the direction n.t+ 
The proof is based on the commutation rule 


{I;, ay} = 1&1 


(QM, (29.4)). The right-hand side may be written as — §;a,, where § is the operator of 
spin 1; the effect of this operator on a vector function is in fact given by 
$a, = — ima; (see QM, $57, Problem 2). Hence 


A A 


lia, — Axl; = — §iAk, 


+ This problem was first discussed by W. Heitler (1936). The solution given here is due to V. B. 
Berestetskii (1947). 
+ For future reference, the value of the function when 6 = 0 (n is along the z-axis) is 


Vey ei ee oes, (7.2a) 
At 
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and therefore 
fide = (li + 8:)ay = axl. 
Consequently 
P@Yim)=aPYin, — J(aYim) = al,Yim- 


Since the spherical harmonic Yj, is the eigenfunction of the operators IP? and 1, 
which corresponds to the respective eigenvalues j(j+1) and m, we arrive at 
equations (7.1). 

The three essentially different types of spherical harmonic vectors are obtained 
by taking as the vector a the three following vectors:t 


Vn NON js (7.3) 
VLG + DY VG + DY 


The spherical harmonic vectors are thus defined as 


1 . 
YO =—— V.Y= P =(-1); 
m—uq+ Dy 37 CD 
Ym=anx Vie, Pay: (7.4) 
Y¥2 =nYon, P =(-1). 


The parity P is also shown for each vector. The three vectors are orthogonal, Y% 
being longitudinal and Y$‘) and Y$"" transverse with respect to n. 

The spherical harmonic vectors can be expressed in terms of the scalar 
spherical harmonics: Y{" in terms of spherical harmonics of the order | =j only, 
and Y‘«) and Y3? in terms of those of order | = j + 1. This is immediately evident on 
comparing the parities shown in (7.4) with the parity (—1)'*’ of a vector field in 
terms of the order of the spherical harmonics concerned. 

The spherical harmonic vectors of any one type are orthonormal: 


| Yin . VY Fin do = 05 Omm'- (7.5) 


For the vectors Yj, this is obvious from the normalization condition for the 


+ The operator V,=|k|Vx, and acts on functions which depend only on the direction of n. In 
spherical polar coordinates its two components are 


0 1 a 
Va = Ge sin 6 aa 
The operator denoted below by Anis the angular part of the Laplacian operator: 


ee sp (sin <5) + | a" 
"sin 6 26 a6) sin? @ ad” 
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spherical harmonics Yjm. For the vectors Y{) the normalization integral is 


____! fy. , 
gen) eae a+ Sunes ie des 


and, since A,Yjm = —JG +1) Yim, equation (7.5) follows. The normalization for the 
vectors Y$,,’ leads to a similar integral. 

The spherical harmonic vectors (7.4) could also be derived without the direct 
verification of equations (7.1) that has been carried out above, using only general 
arguments concerning the transformational properties of functions. In 86, these 
arguments were employed to show that a vector function nd corresponds to an 
angular momentum j which is the same as the order of the spherical harmonics 
occurring in d. If we put simply @ = Yjm, the function n@ will also correspond to a 
definite value m of the angular-momentum component. Thus we derive at once the 
spherical harmonic vectors Y$). But the discussion of transformational properties 
in §6 is unaffected if the factor n in the product n@ is replaced by the vector V, or 
by nx V,. This leads to the other two types of spherical harmonic vectors. 

Let us now consider the photon wave functions. For an electric photon of type 
Ej, the parity of the vector A(k) is (—1)'. The spherical harmonic vectors Y‘) and 
Y‘, possess this parity, but only the former satisfies the transversality condition. 
For a magnetic photon of type Mj, the parity of the vector A(k) is (—1)'*'; only Y{”? 
has this parity. The wave functions of a photon having a given angular momentum 
J, component thereof m, and energy o, are therefore 


Awim (k) = 5 3 ((k| — @)Y jm (n), (7.6) 
where Yj, must be taken as Y‘) and Y‘"? for electric and magnetic photons 
respectively. The given value of the energy is taken into account by the factor 


5(|k| — w). 
The functions (7.6) are normalized by the condition 


aay | wo! A*jm(K) + Aujim(k) 2k = @6(@' — ©) 555mm’. (7.7) 


For wave functions of the coordinate representation, the condition (7.7) is 
equivalent to the conditiont 


+ | {ES jm () : Ewin (¥) a2 Hoy (Pr) : Hu jm(r)} dx = wd(w' a @ ) 8; Omm’: (7.8) 
the integral on the left, when written in terms of the potentials, is 
1 * ! d3 
Dae Adm (r) . Awim(r)@ W xX, 


+ This condition is of the same type as (2.22). The factor 6(w’—) on the right-hand side appears 
because we are now considering a field (spherical wave) throughout infinite space instead of in the finite 
volume V = 1. 
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and with 


d’k 
(27) 


Awim(r) = | Awim(k) ee" 


3 (7.9) 
Abrm(t)= [ Abjatk) oe EK, 
@jm ojym (Qa) 


the integral over d*x gives the delta function (27)°8(k' —k). This is eliminated by 
integrating over d°k, and the integral reduces to (7.7). 

So far, we have assumed that the potentials are in the transverse gauge, for 
which the scalar potential ® = Q. In certain applications, however, other gauges of 
the spherical wave may be more convenient. 

The transformation of the potentials that can be conducted in the momentum 
representation is 


A>A+nf(k), &>@+ f(b, 


where f(k) is an arbitrary function. In the present case we shall choose it so that 
the new potentials are expressed in terms of the same spherical harmonics and 
again have a definite parity. For an electric photon, these conditions limit the 
choice of potentials to the following: 


é€ 4 ; é€ 
AGSmu(k) = pp 8(|k| — @ (VJ + Cn Yim), 
(7.10) 
),(k) = #75 8(|k| — @ CY 
aim(K) = ~ 373 3(|k] — © )CY im, 


where C is an arbitrary constant. For a magnetic photon, this addition to A‘”(k) 
would leave it without a definite parity, and (7.6) is therefore the only possible 
choice under these conditions. 

The probability that a photon having a definite angular momentum and parity 
will be recorded as moving in a direction n which lies in the solid-angle element do 
is, according to (3.5) and (7.6), 


w(n) do = |Y(n)/° do. (7.11) 


This is the expression for an E photon, but, since /Y$??|’ =|Y§’, the probability 
distribution w(n) is the same for both types of photon. 

The squared modulus |Y‘)|? is independent of the azimuthal angle @, since the 
factors e*'’"® in the spherical harmonic functions cancel. The probability dis- 
tribution w(n) is therefore symmetrical about the z-axis. Moreover, since each of 
the spherical harmonic vectors has a definite parity, their squared moduli are 
unaffected by inversion, i.e. by the change of polar angle 0 > 7 — 6; this means that 
the expansion of the function w(@) in Legendre polynomials will contain only those 
of even order. The determination of the expansion coefficients is equivalent to a 
calculation of the integrals of products of three spherical harmonic functions, 
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followed by summation over components. These processes are effected by means 
of the formulae derived in QM, §§107, 108, and the result is 


(ye QU+ DS j j2n\(i  j 2n\fi j2n 
w(6) = (DLS 4m + Dy 0 Vn —m 0 Ny 7 1 pPzn(cos 8).(7-12) 


Finally, we shall give the expressions for the components of the spherical 
harmonic vectors as expansions in terms of spherical harmonic functions. To do so, 


we Shall use the “‘spherical components”’ of a vector, defined as in QM, $107. These 
components f, of a vector f are 


i i 
fo= if: fai = 75 (fs + ify), fai = 75 i — ify). (7.13) 
In terms of the “spherical unit vectors’’, 
e = je, gai ee (e) + je), gn: 75 (e®) — ie), (7.14) 


where e®»” are unit vectors in the direction of x, y and z, we have 
LS > (— 1) *Ff_,e, a a 1) “f eV 
A 
=f-e, (7.15) 


The spherical components of the spherical harmonic vectors are expressed in 
terms of 3j-symbols and spherical harmonic functions as follows: 


j+m+At e ‘ ] a 1 1 ] 
ere yaa. ok et 
; j-1 1 j ) | 
sm Vj oF D() + r —} —m Yas 
; ] 1 ] 
(— [yt = —V(2j + D(, on ay sl Miah (7.16) 


J+1 1 ] 


m+r —dA Vitec 


(1m ry), = VG+ 1)( 
m+A —-A —-m 
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These formulae are derived in the following way. Each of the three spherical 
harmonic vectors is of the form Yjm=aYjm, where a is one of the three vectors 
(7.3). Hence 


Vain = > (lm'|aljm) Yim’; 
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and the problem is equivalent to that of finding the matrix elements of the vector a 
with respect to the eigenfunctions of the orbital angular momentum. According to 
QM, (107.6), we have 


lL dsj 
m' YX m 


(imam) = (ym (_ )dllal, 


where jmax 1S the larger of | and j. It is therefore sufficient to know the non-zero 
reduced matrix elements (I|la||j). These are given by the formulae 

(1 — tall) = (In| — 1)* = iV, 

(1|Valll — 1) =i(l— 1), 

(1 — 1]Valll) = i + V1, 

(In x Vall) = iV (LC + 1)(21 + 1). 


(7.17) 


§8. The polarization of the photon 


The polarization vector e acts for the photon as the “spin part” of the wave 
function (with the limitations stated in 86 in connection with the concept of photon 
Spin). 

The various cases which can occur with regard to the polarization of the photon 
are identical with the possible types of polarization of a classical electromagnetic 
wave (see Fields, §48). 

Any polarization e can be represented as a superposition of two mutually 
orthogonal polarizations e“ and e® (e? - e* = 0), chosen in some specified manner. 
In the resolution 


e= ee) + e,e%, (8.1) 


the squares of the moduli of the coefficients e,; and e, determine the probabilities 
that the photon has polarization e“ and e” respectively. 

These polarizations may be taken to be two mutually perpendicular linear 
polarizations. We can also resolve any polarization into two circular polarizations 
having opposite directions of rotation. The vectors of the right-hand and left-hand 
circular polarizations will be denoted by e“” and e“” respectively; in coordinates &, 
n, ¢, with the ¢-axis in the direction of the photon n=k/a, 


gers p.: A (+ie), eMM= ap (e® — ie”), (8.2) 


The possibility that the photon has two different polarizations (for a given 
momentum) is equivalent to the statement that each eigenvalue of the momentum is 
doubly degenerate. This property is closely related to the fact that the mass of the 
photon is zero. 
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A freely moving particle with non-zero mass always has a rest frame. The 
intrinsic symmetry properties of the particle, as such, will evidently appear in this 
particular frame of reference. Symmetry with respect to all possible rotations about 
the centre (i.e. with respect to the entire spherical symmetry group) must be 
considered. The property which describes the symmetry of the particle with 
respect to this group is its spin s; this determines the degree of degeneracy, the 
number of different wave functions which are transformed into linear combinations 
of one another being 2s +1. In particular, a particle having a vector (three- 
component) wave function has spin 1. 

If the mass of the particle is zero, however, there is no rest frame, since it 
moves with the velocity of light in every frame of reference. For such a particle, 
there 1s always a distinctive direction in space, the direction of the momentum 
vector k (the ¢-axis). In such a case there is clearly no symmetry with respect to 
the whole group of rotations in three dimensions, but only axial symmetry about 
the preferred axis. 

When there is axial symmetry, only the helicity of the particle is conserved, i.e. 
the component of its angular momentum along the ¢-axis, which we denote by A.t 
If we also impose the condition of symmetry under reflections in planes passing 
through the ¢-axis, the states differing in the sign of » will be mutually degenerate, 
and when A#0 there is therefore twofold degeneracy.t The state of a photon 
having a definite momentum in fact corresponds to one type of these doubly 
degenerate states. It is described by a “‘spin”’ wave function which is a vector e in 
the éy-plane; the two components of this vector are transformed into combinations 
of each other by any rotation about the ¢-axis and by any reflection in a plane 
passing through that axis. 

The various cases of the polarization of the photon are in a certain relationship 
to the possible values of its helicity. The relationship can be deduced from the 
formulae in QM (57.9), which connect the components of a vector wave function 
with those of the equivalent spinor of rank two.§ Vectors e with only the component 
é;—1€, or @ +ie, non-zero correspond to the components 4 = +1 or —1 respec- 
tively; these are e=e” and e=e”. In other words, the values \ = +1 and —1 
correspond to right-hand and left-hand circular polarization of the photon. In §16 
the same result will be derived by direct calculation of the eigenfunctions of the 
spin component operator. 

Thus the component of the photon angular momentum along the direction of its 
motion can have only the two values +1; the value zero is not possible. 

A state of the photon having a definite momentum and polarization is a pure 
state, in the sense defined in QM, §14; it is described by a wave function, and 
corresponds to a complete quantum-mechanical description of the state of the 
particle (the photon). ‘““Mixed”’ photon states are also possible, which correspond to 
a less complete description by a density matrix only, not a wave function. 


+ This is to be distinguished from m, the component of the angular momentum in a specified 
direction in space (the z-axis), which was used in §7. 

t This is the method of classifying the electron terms of the diatomic molecule (QM, §78). 

§ It is the contravariant spinor components that correspond to the components of the wave function 
as the probability amplitudes of various values of the angular momentum of the particle (which are here 
considered). 
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Let us consider a state of the photon which is mixed as regards its polarization, 
but corresponds to a definite value of the momentum k. In such a state (called a 
state of partial polarization), a “‘coordinate”’ wave function exists. 

The polarization density matrix of the photon is a tensor pug of rank two, in a 
plane perpendicular to the vector n (the &y-plane; the suffixes a, B take only two 
values). This tensor is Hermitian: 


Pop = Ppa» (3.3) 
and is normalized by the condition 
Poo = Pir + Px = 1. (8.4) 


From (8.3), the diagonal components p;; and py are real, and either is given in terms 
of the other by (8.4). The component p,;. is complex, and p); = p*. The density 
matrix therefore involves three real parameters. 

If the polarization density matrix is known, we can find the probability that the 
photon has any given polarization e. This probability is determined by the “‘pro- 
jection” of the tensor p,g on the direction of the vector e, i.e. by the quantity 


Papeh ep. (8.5) 


For example, the components pi; and py are the probabilities of linear polarizations 
along the € and 7 axes. The probability of the two circular polarizations is given by 
taking the projections along the vectors (8.2): 


a1 + i(pi2 — P21)). (8.6) 


The properties of the tensor p,g are essentially the same as those of the tensor 
Jug Which describes partially polarized light in the classical theory (see Fields, §50). 
Some of these properties are the following. 

For a pure state with a definite polarization e, the tensor p,g reduces to products 
of components of the vector e: 


Pop = Cals (8.7) 


and the determinant |p,g|=0. In the opposite case of an unpolarized photon, all 
directions of polarization are equally probable, i.e. 


Pap = + Sap, (8.8) 
and |pap| = 4. 
In the general case, it is convenient to describe the partial polarization by means 


of three real Stokes parameters &,, &, &,f in terms of which the density matrix can 


+t These are not to be confused with the é-axis. 
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be written 


_1/1+& &&—-i& 
poe 3 6 + ie l—-& ) aor) 


All three parameters take values between —1 and +1. In the unpolarized state, 
E, = & = & =0; for a completely polarized photon, é{+ €5+ €3 = 1. 

The parameter &3 describes the linear polarization along the € or 7 axis; the 
probability that the photon is linearly polarized along these axes is respectively 
(1+ &) and 3(1 — é;). The values &;= +1 and —1 therefore correspond to complete 
polarization in these directions. 

The parameter &, describes the linear polarization along directions at angles 
@ = +47 to the €-axis. The probability that the photon is linearly polarized along 
these directions is respectively 3(1+é,) and 3(1—€&,). This is easily shown by 
projecting the tensor p,, on the directions e = (1, +1)/V2. 

Finally, the parameter &, represents the degree of circular polarization: accord- 
ing to (8.6), the probability that the photon has right-hand or left-hand circular 
polarization is respectively 3(1+&) and 3(1—é,). Since these two polarizations 
corresperd to helicities A = +1, it is clear that & is the mean value of the helicity of 
the photon. Moreover, for a pure state with polarization e, 


é =ieXet-n. (8.10) 


The quantities € and V(é{+ €3) are invariant under Lorentz transformations 
(see Fields, §50). 

We shall later encounter the problem of the behaviour of the Stokes parameters 
under the operation of time reversal. It is easily seen that they are invariant. This 
property is evidently independent of the state of polarization, and therefore need 
be proved only for a pure state. In quantum mechanics, time reversal corresponds 
to replacing the wave function by its complex conjugate (QM, 818). For a 
plane-polarized wave, this implies the changest 


k—- —k, e>-—e*. (8.11) 
Under this transformation, the symmetrical part 
2(€,e% + ege*) 


of the density matrix is unchanged, and therefore so are €, and &. The fact that €, 
is unchanged by this transformation is seen from (8.10), and is also evident from 
the fact that € 1s the mean value of the helicity: the helicity is the component of 
the angular momentum j in the direction of n, i.e. the product j-n, and both these 
vectors change sign under time reversal. 

+ The change in the sign of e is necessary because time reversal changes the sign of the vector 
potential of the electromagnetic field. The scalar potential, however, does not change sign, and the effect 
of time reversal on the 4-vector e is therefore as follows: 


(€0, e) > (e, —e*). (8.11a) 
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In later calculations, we shall need the photon density matrix written in 
four-dimensional form, i.e. as a certain 4-tensor p,,. For a polarized photon 
described by the 4-vector e,, this tensor can naturally be defined as 


Puv = Cy€s- (8.12) 


In the three-dimensionally transverse gauge, e = (0, e), and if one of the spatial 
coordinate axes is taken to be along n the non-zero components of this 4-tensor are 


the same as (8.7). 
For an unpolarized photon the three-dimensionally transverse gauge cor- 
responds to a tensor p,, having components 


Pik = (Six — nin), Poi = pio = Poo = 9; (8.13) 


if one of the axes is in the direction of n, the result is again (8.8). It would, 
however, be inconvenient to use the tensor p,, in this three-dimensional form. But 
a gauge transformation can be applied, which for the density matrix is 


Puv > Puv + Xukv + Xk, (8.14) 
where the x, are arbitrary functions. Putting 


Xo = 1/4, Xi = kl4o’, 
we obtain instead of (8.13) the simple four-dimensional expression 


Puv = 72 uv (8.15) 


The four-dimensional form of the density matrix for a partially polarized photon is 
easily found by first writing the two-dimensional tensor (8.9) in three-dimensional 
form: 


pix = 2(e)P ey? + ePe?) +3& (Eire? + ePel?) — 
1°: 1 
— 21€,(e Pep — eel) +r& (ere, — ee), 
where e” and e” are unit vectors along the € and y axes. The required generaliza- 


tion is obtained on replacing these 3-vectors by real space-like unit 4-vectors e”, 
e” which are orthogonal to each other and to the photon 4-momentum k: 


eM = p24 
ee = ¢ (8.16) 


3 
eK = eMk =0. 


In one particular frame of reference, e“ =(0, e) and e@=(0, e). Thus the 
four-dimensional density matrix of the photon is 


mee | 1 1 2) ,(2 1 1,2 2),(01 
Pay ee, Pe ne, ) F2b(ee, +e, e,)= 


Fie (ePe? — ee) +4 (ee — ee, (8.17) 
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The convenience of any specific choice of the 4-vectors e”, e® depends on the 
conditions of the problem concerned. 

It must be noted that the conditions (8.16) do not uniquely define the choice of 
e” and e”. If a 4-vector e, satisfies these conditions, then so does any 4-vector 
e, + xk,, since k* = 0. This non-uniqueness occurs because the density matrix is not 
invariant under gauge transformations. 

The first term in (8.17) corresponds to the unpolarized state. According to (8.15), 
it can therefore be replaced by 3 Lu This change is again equivalent to a certain 
gauge transformation. 

The following formal device is useful in calculations with 4-tensors of the form 
(8.17) expressed in terms of two independent 4-vectors. We write the tensor (8.17) 
in the form 


3 

= b b 

i= » | p\ ree. 
a,b= 


(25) as a two-rowed matrix: 


aie a 
2 ° 

p2) yp 

This, like any two-rowed Hermitian matrix, can be written in terms of four 


independent two-rowed matrices: the Pauli matrices o,, aj, o, and the unit matrix 1. 
The result is 


and the coefficients p 


p =3(1+&-0), E = (&, &, &3), (8.18) 


as is easily seen by direct comparison with (8.17), using the expressions (18.5) for 
the Pauli matrices. The combination of the three quantities €, &, & into a “‘vector”’ 
€ is, of course, purely formal and is done only for convenience of notation. 


PROBLEM 


Write the photon density matrix for the case where the coordinate “taxes” are the circular unit 
vectors (8.2). 

SOLUTION. The components p’ug of the tensor relative to the new axes (a, B = +1) are obtained by 
projecting the tensor (8.9) on the unit vectors (8.2): 


py = popes *es”, P\,-1 = piste: Pee, a) 
p'=5 1+ & ae) 
2\—&3 — 161 1-& } 


§9. A two-photon system 
By arguments similar to those in 86, we can calculate the number of possible 


states in a more complicated case, that of a system of two photons (L. Landau, 
1948). 


QE4~-D 
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We shall consider the photons in their centre-of-mass system; their momenta 
are k, = —k,=k.t The wave function of the two-photon system (in the momentum 
representation) can be written as a three-dimensional tensor of rank two Ai;(n), 
formed by a bilinear combination of the vector wave functions of the two photons; 
each of the suffixes of this tensor corresponds to one of the photons (n being a unit 
vector in the direction of k). The transversality of each photon is expressed by the 
orthogonality of the tensor A, to the vector n: 


Ain) = 0, Ann = 0. (9.1) 


An interchange of the photons corresponds to an interchange of the suffixes of 
the tensor A;, and a simultaneous change in the sign of n. Since photons obey Bose 
statistics, we have 


Aix(—n) = Axi(n). (9.2) 


The tensor A, is not in general symmetrical with respect to its suffixes. It can 
be resolved into symmetric and antisymmetric parts: Aj = s+ dix. The equation 
(9.2), and the orthogonality conditions (9.1), must evidently apply to each part 
separately. Hence we have 


Sik(—M) = Six (Qn), (9.3) 
ix(—n) = — a(n). (9.4) 


Inversion of the coordinates does not affect the sign of the components of a tensor 
of rank two, but changes the sign of n. From (9.3), therefore, the wave function sj 
is symmetrical under inversion, i.e. it corresponds to even states of the two-photon 
system, while the wave function a, corresponds to odd states. 

An antisymmetric tensor of rank two is equivalent (dual) to a certain axial 
vector a, whose components are given in terms of those of the tensor by 


235M 
Qi = 2 Cin Ak, 


ex, being the antisymmetric unit tensor; see Fields, 86. The orthogonality of the 
tensor a, and the vector n implies that the vectors a and n are parallel.t We can 
therefore write a=nd(n), where @ is a scalar; according to (9.4), we must have 
a(—n) = —a(n), and therefore 


o(-n) = o(n). 


This equation signifies that the scalar @ can be formed linearly only from spherical 
harmonic functions of even order L (including order zero). 


+ This frame of reference always exists except in the case of two photons moving in the same 
direction. The total momentum k; +k: and the total energy w, + w2 of such photons are related in the 
same way as those of a single photon, and there is therefore no frame of reference in which ki + k2 = 0. 

+t For ax = exa, and the orthogonality condition gives ain, = ean, = (n x a); = 0. 
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We see that the transformation properties of the antisymmetric tensor a; under 
rotations are equivalent to those of a single scalar (cf. the second footnote to $6). 
When the latter is assigned a “spin” zero, the angular momentum of the state is 
found to be J = L. Thus the tensor a, corresponds to odd states of a photon 
system with even angular momentum J. 

Let us now consider the symmetric tensor sx. Since this is unaltered when n 
changes sign, it corresponds to even states of the photon system. Hence all the 
components s; can be expressed in terms of spherical harmonic functions of even 
order L (including zero). It is well known that any symmetric tensor s, of rank two 
can be expressed as the sum of a scalar s; and a symmetric tensor s; with zero 
trace (si; = 0). 

The scalar s,; can be assigned a “‘spin’”’ zero, and the angular momentum of the 
corresponding states is therefore J = L, i.e. is even. The tensor sj, has “‘spin’” two 
(see QM, 857). Adding this “‘spin’’ to the even “‘orbital angular momentum” L by 
the law of addition of angular momenta, we find that for a given even J # 0 three 
states are possible (with L = J +2, J), and for odd J# 1 two states (with L = J +1). 
The exceptions are J =0 with one state (L =2) and J = 1 with one state (L = 2). 

In these calculations, however, we have not yet included the condition that the 
tensor s, 1S orthogonal to the vector n. We must therefore subtract, from the 
numbers of states found above, the numbers of states corresponding to a sym- 
metric tensor of rank 2 “parallel” to the vector n. Such a tensor, which we denote 
by si, can be written as 


Sik = nib, + n,bi, 


where b is a certain vector. According to (9.3), this vector must be such that 
b(—n) = —b(n). Thus the tensor sj, which gives the “unwanted”’ states is equivalent 
to an odd vector. The latter must be expressible in terms only of spherical 
harmonics of odd order L. Moreover, the vector has a “‘spin” one, and therefore, 
for any even angular momentum J # 0, two states are possible (with L = J +1), and 
for any odd J one state (with L = J); an exception is J = 0 with one state (L = 1). 

Summarizing the results obtained, we obtain the following table giving the 
numbers of possible even and odd states of a two-photon system (with zero total 
momentum) for various values of the total angular momentum J: 


J even odd 
0 1 1 
1 0 0 (9.5) 
2k 2 l 
2k+1 1 0 


where k is any positive integer (not zero). We see that for odd J there are no odd 
states, and the value J = 1 cannot occur.f 

The wave function A; of the two-photon system determines the correlation 
between the polarizations of the photons. The probability that both photons 


+ Another way of deriving these results is given in 870, Problem 1. 
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simultaneously have definite polarizations e, and e2 is proportional to 
Aine te x. 
Thus, if the polarization e,; of one photon is given, the polarization e, of the other is 
Orn % Aj ii. (9.6) 
In odd states of the system, A, is equal to the antisymmetric tensor a, and 
er-et < axe tek = 0, 


so that the polarizations of the two photons are orthogonal. For linear polarization 
this means that the directions of polarization are perpendicular; for circular 
polarization, that the directions of rotation are opposite. 

An even state with J = 0 corresponds to a symmetric tensor which reduces to a 
scalar, 


Six = constant X (6, — njn,). 


From (9.6), therefore, we have e; = e%. For linear polarization this means that the 
directions of polarization are parallel; for circular polarization, that the directions 
of rotation are again opposite. The latter result is obvious, since when J =0 the 
sum of the components of the photon angular momenta in the same direction k 
must always be zero, because the components in opposite directions k, and ky, 1.e. 
the helicities, are equal. 


CHAPTER II 


BOSONS 


§10. The wave equation for particles with spin zero 


IT HAS been shown in Chapter I how a quantum description of the free elec- 
tromagnetic field can be constructed on the basis of the known properties of the 
field in the classical limit and the concepts of ordinary quantum mechanics. The 
resulting scheme for describing the field as a system of photons contains many 
features which occur also in the relativistic quantum theory description of parti- 
cles. 

The electromagnetic field is a system having an infinite number of degrees of 
freedom. For this system there is no law of conservation of number of particles 
(photons), and its possible states include states with an arbitrary number of 
particles.t In the relativistic theory, systems composed of any particles must in 
general share this property. The conservation of number of particles in the 
non-relativistic theory depends on the law of conservation of mass: the sum of the 
(rest) masses of the particles is unaffected by their interactions, and the constancy 
of the total mass in a system of electrons, say, implies that the number of electrons 
is also unchanged. In relativistic mechanics, however, there is no law of con- 
servation of mass; only the total energy of the system is conserved, which includes 
the rest energy of the particles. The number of particles therefore need not be 
conserved, and consequently every relativistic theory of particles must be a theory 
of systems having an infinite number of degrees of freedom. That is to say, any 
such theory of particles must be a field theory. 

The second quantization formalism (QM, 8864, 65) is a satisfactory means of 
describing systems with a variable number of particles. In the quantum description 
of the electromagnetic field, the second quantization operator is the 4-potential A. 
This is expressed in terms of the (coordinate) wave functions of the individual 
particles (photons) and their creation and annihilation operators. The quantized 
wave function operator has a similar role in the description of a system of particles. 
To derive this operator, we must first know the form of the wave function of a 
single free particle and the equation satisfied by this function. 

The concept of a field of free particles is, it must be emphasized, only an aid to 
the theory. Actual particles interact, and the task of the theory is to consider these 
interactions. But any interaction is equivalent to a collision, before and after which 
the system may be regarded as an ensemble of free particles. It has been remarked 
in §1 that the only measurable objects are of this kind. We therefore use the fields 
of free particles as a means of describing the initial and final states. 


tIn reality, of course, the number of photons changes only as a result of various interaction 
processes. 
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The wave equation must be a differential relationship between the quantities ys 
and w, through the operator p. This relationship must, of course, be given by 
relativistically invariant expressions. Such expressions are 


mis, = Pup, p'y, = my, (10.4) 


where m is a dimensional constant characteristic of the particle.? 
Substituting w, from the first equation (10.4) in the second equation, we obtain 


(p°— m’)p = 0 (10.5) 


(O. Klein, and V. A. Fock, 1926; W. Gordon, 1927). The explicit form of this 
equation is 


— 0,0" = (-% + A)w =m. (10.6) 


Substitution of ys as the plane wave (10.2) gives p?= m7’, from which it is evident 
that m is the mass of the particle. We may note that the form of equation (10.5) is 
in any case obvious a priori, since p’* is the only scalar operator which can be 
derived from p (and, for the same reason, a similar equation is satisfied by every 
component of the wave function of a particle having any spin value, as will be seen 
on several occasions below). 

Thus a particle with spin zero is essentially described by a single (four- 
dimensional) scalar w, which satisfies the second-order equation (10.5). In the 
first-order equations (10.4), the wave function is represented by the set of quan- 
tities & and w,, the 4-vector w, being the 4-gradient of the scalar w. In the rest 
frame, the wave function of the particle is independent of the (space) coordinates, 
and the space components of the 4-vector w, are therefore zero, as they should be. 

In order to continue with the second quantization procedure, it is useful to 
express the energy and momentum of the particle as the space integrals of certain 
combinations bilinear in w and w*, which represent a kind of space density of these 
quantities. We thus have to find an energy-momentum tensor T,, which cor- 
responds to equation (10.5). In terms of this tensor the law of conservation of 
energy and momentum ts expressed by the equation 

d,T¢ =0. (10.7) 

Following the general procedure of field theory (see Fields, 832), we write down 
a variational principle which would lead to equation (10.5). This principle must be 
that the “‘action integral” 


Se | Ld*x (10.8) 


+ The constants m are shown in (10.4) so that yw, and w shall have the same dimensions. There 
would be no point in using different constants m; and m2 in the two equations, since they could always 
be made the same by redefining w or Wy. 
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of some real 4-scalar L, the Lagrangian density of the field,t should take a 
minimum value. Using the scalar w (and the operator 0"), we can construct a real 
bilinear scalar expression of the form 


L = 4, * - a% yp — m’w*y, (10.9) 


where m is a dimensional constant. Regarding W and w* as independent variables 
describing the field (‘generalized field coordinates’? q), we easily see that 
Lagrange’s equation 


dob _ ob 
0x" 0G, Oq 


(10.10) 


(where q_, = 0,q) is in fact the same as the equation (10.5) for w.and w*, m being 
the mass of the particle. The sign of the expression (10.9) has been taken such that 
the square of the time derivative, |dy/dt|’, appears in L with a positive sign; 
otherwise, the action could not take a minimum value (cf. Fields, §27). The choice 
of the numerical factor in L is arbitrary (and affects only the normalization factor 


in ys). 


The energy-momentum tensor can now be calculated from the formula 


aL 


the summation being over all gq. Substitution of (10.9) gives 
T,, = Ouy* + Ob + d,* - d.wW — Lg,y3 (10.12) 


these quantities are real (as they should be), since L is real. In particular, 


feat 5 


ot ot 
_ Op* oy 

Pee Tu Vt mW, (10.13) 
eal ee oe (10.14) 


0 or ax’ ax’ at’ 


The 4-momentum of the field is given by the integral 
P, = T,49 dx, (10.15) 


i.e. To) and To; act as the energy and momentum densities. The quantity To is 
essentially positive. 
+ The corresponding second-quantized operator L is called the Lagrangian of the field. To simplify 


the terminology, we shall use this term for either the “‘quantized” or the “non-quantized” Lagrangian 
density, as convenient. 
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Formula (10.13) can be used for the normalization of the wave function. A plane 
wave, normalized to “one particle in the volume V = 1”, is 


a l —ipx 
Wp = Vie” (10.16) 


since for this function Tp) = ¢, and the total energy in the volume V = 1 is therefore 
equal to the energy of a single particle. 
The angular momentum, whose conservation is due to the isotropy of space, 
can also be expressed as a space integral, but we shall not need this representation. 
There is one further conservation law allowed by equations (10.4) in addition to 
those arising directly from space-time symmetry. It is easily seen that these 
equations and those for w* lead to the equation 


d.J” =0, (10.17) 
where 


ju = Mbp, + WW) 
= iLp* ayy — (dny*)y]. (10.18) 


Thus j” acts as a current density 4-vector, and (10.17) is the equation of continuity 
expressing the law of conservation of the quantity 


Q= | io a's, (10.19) 
where 
cies Oat x IY ay* 
Jo=] i(y of (10.20) 


It should be noted that jo need not be positive. This shows that it cannot in 
general be interpreted as the probability density of spatial localization of the 
particle. The significance of the conservation law expressed by equation (10.17) will 
be shown in §11. 


§11. Particles and antiparticles 


In accordance with the general procedure of the second quantization method, 
we have to consider the expansion of an arbitrary wave function in terms of the 
eigenfunctions of a complete set of possible states of a free particle, for instance in 
plane waves yw: 


b= py Apt, w= >) ays. 


P 
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The coefficients a,,a* are then to be regarded as the annihilation and creation 
operators d,, G, of particles in the corresponding states. 

Here, however, we immediately encounter a difference of principle as compared 
with the non-relativistic theory. In a plane wave which is a solution of equation 
(10.5), the energy e need satsify (for a given momentum p) only the condition 
e° = p’+m’, i.e. it can have two values, +V(p’ +m’). Only positive values of ¢ can 
have the physical significance of the energy of a free particle. But the negative 
values cannot be simply omitted: the general solution of the wave equation can be 
obtained only by superposing all its independent particular solutions. This shows 
that the interpretation of the expansion coefficients for w and w* in the second 
quantization method must be somewhat different. 

We may write the expansion in the form 


a | (+) pi(p:r-et) I (—) pi(p-rtet) 
ve 2 (Qe) °? € so (2e) apn’ & ; (11.1) 


where the first sum contains plane waves with positive “frequency”, normalized 
according to (10.16), and the second sum contains those with negative “frequency”, 
e always denoting the positive quantity +V(p’?+m’). In the second quantization, 
the coefficients a\” in the first sum are replaced as usual by the particle annihilation 
operators d,. In the second sum, we note that, in the subsequent derivation of the 
matrix elements, the time dependence of the terms will correspond to particle 
creation, not annihilation: the factor e' = (e ‘)* corresponds to one extra particle 
with energy « in the final state (cf. the end of §2). Accordingly, the coefficients a” 
are replaced by creation operators bt, relating to other particles. If the summation 
variable p in the second sum in (11.1) is replaced by —p in order to put the 
exponential factor in the form e'®'' ©”, the w-operators are obtained as 


, l A —ipx “+  ipx 
b= D7) ane + bt e'*), 
(11.2) 


7+ — 1 A+ ipx i. = Ipx 
us = 2 dg) (ae & +b, € ). 


Thus all the operators dp, b, are multiplied by functions with the “correct” time 
dependence (~e ““), while the operators d*, b+ are multiplied by the complex 
conjugate functions. This makes it possible to interpret the former operators, in 
accordance with the general rules, as annihilation operators for particles with 
momentum p and energy e«, and the latter as creation operators for these particles. 

In this way we arrive at the concept of particles of two types which occur 
simultaneously and on an equal footing. These are called particles and antiparti- 
cles; the significance of the names will be shown later. One type corresponds to the 
operators Gy, a; in the second quantization formalism, and the other type to b,, be. 
The two types of particle have the same mass, since their operators appear in the 
same w-operator. 


+ The w function is given the 4-momentum p as suffix, since we intend to denote the functions with 
“negative frequency” by w_p. The operators d and da” are given the three-dimensional momentum p as 
suffix, since this entirely defines the state of an actual particle. 
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The reason for these results can also be examined from the point of view of the 
requirements of relativistic invariance. 

The Lorentz transformations are, mathematically, rotations of the four-dimen- 
sional coordinate system which change the direction of the time axis; together with 
the purely spatial rotations which do not affect the time axis, they form the Lorentz 
group of transformations.* All the Lorentz transformations have the property that 
they leave the t axis within the corresponding light cone, and this expresses the 
physical principle that there exists a maximum possible velocity of propagation of 
signals. 

In a purely mathematical sense, the simultaneous change of sign of all four 
coordinates (four-dimensional inversion) is also a rotation, since the determinant of 
this transformation is +1, like that of any rotational transformation. The time axis 
is thereby carried from one light cone to the other. Although this means that sucha 
transformation is physically impossible (as a transformation of the frame of 
reference), the only difference mathematically is that, because the metric is 
pseudo-Euclidean, such a rotation cannot be effected continuously without allow- 
ing also a complex transformation of the coordinates. 

It is reasonable to suppose that this difference is unimportant in relation to 
four-dimensional invariance. Then any expression which is invariant under the 
Lorentz transformations must be invariant under 4-inversion also. A_ precise 
statement of this condition as applied to the scalar w-operator will be given in $13, 
but here it may be noted that the condition will certainly make necessary the 
simultaneous presence in the w-operators of terms having both signs of ¢ in the 
exponents, since this sign is changed by the substitution t > — t. 

Let us return now to equations (11.2) and derive the commutation relations 
between the operators d,, a; (and b,, b;). For photons (the operators ¢,, ¢}), this was 
done on the basis of the analogy with oscillators, that is, essentially from the 
properties of the electromagnetic field in the classical limit. Here there is no such 
analogy. In deriving the (Bose or Fermi) commutation rules between the operators, we 
can be guided only by the form of the Hamiltonian constructed from these operators. 

This Hamiltonian is obtained (see QM, §64) by substituting & and /* in place of 
w and w* in the integral f Ty d’x.t We then find 


A 


H = 2, &(4 a, + b,b}). (11.3) 


It is easily seen that a reasonable result is obtained for the eigenvalues of this 
Hamiltonian only if the operators satisfy the Bose commutation rules: 


{Gp, G3}- = {bp, bp}- = 1 (11.4) 


+ The set of all three-dimensional (spatial) rotations is itself a group, which constitutes a subgroup of 
the Lorentz group. The set of the Lorentz transformations is not itself a group, since the result of 
successive Lorentz transformations may be a purely spatial | rotation. 

¢ In the non-relativistic theory, the conjugate operator pt is by convention written to the left of w. 
Here, the order is of no importance, since the interchange of s and &* would cause only the interchange 
of the equivalent operators a, and b». However, once a particular order has been selected, the same 
order must be used throughout. 
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(all other pairs of operators commute, including each particle operator 4,, 4G; with 
each antiparticle operator b,, b>). For, in this case, 


H =>) e(4}4,+ bib, + 1). 


P 
The eigenvalues of the products d}4, and b;b, are positive integers N, and N,, the 


numbers of particles and antiparticles. The infinite additive constant =e (the 
“energy of the vacuum’’) may again be simply omitted: 


E => e(Npt+Ny,); (11.5) 


cf. formula (3.1) and the footnote to it. This expression is essentially positive, and 
corresponds to the idea of two types of actual particles. Similarly, we have for the 
total momentum of the system 


P= > p(N,+N,). (11.6) 


If, instead of (11.4), we used the Fermi commutation rules (anticommutators 
instead of commutators), we should obtain 


H =>, c(aj4, — b3b, + 1), 
P 


and instead of (11.5) the physically meaningless expression = e(N, — N,), which is 
not positive-definite and hence cannot represent the energy of a system of free 
particles. 

Particles with spin zero are therefore bosons. 

Next, let us consider the integral Q (10.19). Replacing the functions w& and w* in 
j° by the operators & and W*, and carrying out the integration, we obtain 


O = > (G34, — bb }) = > (434, — bFb, - 1). (11.7) 
p p 


The eigenvalues of this operator are (omitting the unimportant additive constant 
> 1) 


Q => (N,— Np) (11.8) 


and are therefore equal to the differences between the total numbers of particles 
and antiparticles. 

So long as we are discussing free particles and ignoring any interaction between 
them, the law of conservation of the quantity Q is, of course, largely conventional 
(like those of total energy (11.5) and total momentum (11.6)): what is actually 
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conserved is not only the sum Q but the numbers N,, N p individually. The nature 
of the interaction decides whether the quantity Q is conserved. If Q is conserved 
(i.e. if the operator Q commutes with the Hamiltonian of the interaction), the 
formula (11.8) shows the limitation imposed by the conservation law on the 
possible variation of the number of particles: only “‘particle—antiparticle”’ pairs can 
be formed or disappear. 

If a particle is electrically charged, its antiparticle must have a charge of the 
opposite sign: if both had charges of the same sign, the creation or annihilation of 
the particle—antiparticle pair would contravene a rigorous law of nature, the 
conservation of total electric charge. We shall see later ($32) how the theory 
automatically leads to this oppositeness of the charges (for interactions of particles 
with an electromagnetic field). 

The quantity Q is sometimes called the charge of the field of the particles 
concerned. For electrically charged particles Q gives, in particular, the total 
electric charge of the system in terms of the unit charge e. But particles and 
antiparticles may also be electrically neutral. 

Thus we see that the nature of the relativistic relation between the energy and 
the momentum (the twofold root of the equation e* = p’+ m’), together with the 
requirements of relativistic invariance, leads in the quantum theory to a new 
principle of classification of particles: there can exist pairs of different particles 
(particle and antiparticle) which are interrelated in the way described above. This 
remarkable prediction was first made (for particles with spin 4) by Dirac in 1930, 
before the discovery of the first antiparticle, the positron.? 


§12. Strictly neutral particles 

In the second quantization of the w-function (11.1), the coefficients a” and at” 
were treated as operators relating to different particles. This is not necessary, 
however: as a particular case, the annihilation and creation operators in & may 
relate to the same particles, as for photons (cf. (2.17)). Then, denoting these 
operators by ¢, and ¢;, we write the W-operator as 


fe l A —ipx a+ ipx 
W = d 75 (Cre a Ones dae CZs) 


The field described by this operator corresponds to a system of particles of one 
kind only, which may be said to be their own antiparticles. 

The operator (12.1) is Hermitian (* = &), and in this sense such a field has only 
half as many “‘degrees of freedom” as a complex field for which the operators WJ 
and w* are not the same. 

In consequence, the field Lagrangian, expressed in terms of the Hermitian 
operator wW, must contain a further factor 3 in comparison with (10.9):¢ 


L = (0, - 0% — mW’). (12.2) 


+ The antiparticle concept was extended to bosons by V. Weisskopf and W. Pauli (1934). 

+ This resembles the extra factor 3 in the operator (2.10) of the electromagnetic field energy density 
(when the field is expressed in terms of the Hermitian operators E and H), in comparison with the 
photon energy density (3.2) expressed in terms of the complex wave function; cf. the last footnote to §3. 
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The corresponding energy-momentum tensor is 
T,,, = d,W - dup — Le,., (12.3) 
and hence the energy density operator is 


Ts = (aw/aty = EL 
= (aplatyY + (Vp) + mp’). (12.4) 


Substituting (12.1) in the integral { Tyo d*x, we obtain the field Hamiltonian: 


H =3>, e(€3¢,+ 6, ¢9). (12.5) 
p 


(py Care = 4, (12.6) 


and the energy eigenvalues (again without the additive constant) are 
E =)! «,N,. (12.7) 
Pp 


Fermi quantization would lead to the absurd result that E is independent of N,. 

The “‘charge’”’ Q of this field is zero, as is evident from the fact that Q must 
change sign when particles are replaced by antiparticles, whereas in the present 
case there is no difference between the two. The current density 4-vector therefore 
does not exist, since the expression 


jn = if uh — (nb pb] (12.8) 


for the operator j of the conserved 4-vector is zero when & = W* (the vector sd, 
is not itself conserved). This, in turn, means that there is no special conservation 
law restricting the possible changes in the number of particles. Such particles must 
clearly be electrically neutral. 

Particles of this kind are said to be strictly neutral, as opposed to electrically 
neutral particles which are not their own antiparticles. Whereas the latter can be 
annihilated (transformed into photons) only as pairs, strictly neutral particles can 
be annihilated singly. 

The structure of the w-operator (12.1) is similar to that of the electromagnetic 
field operators (2.17)-(2.20). In this sense we may say that photons are themselves 
strictly neutral particles. For the electromagnetic field, the operators are Hermitian 
because the fields are measurable physical quantities (in the classical limit) and are 
therefore real. For the w-operators of particles there is no such relation, since they 
do not correspond to any quantities that are directly measurable. 

The absence of a conserved current 4-vector is a general property of strictly 
neutral particles, and does not require the spin to be zero; for instance, it occurs 
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for photons also. Physically, it expresses the absence of the corresponding pro- 
hibitions on a change in the number of particles. There is a direct formal relation 
between the absence of a conserved current and the fact that the field is real (the 
operator & is Hermitian). 

The Lagrangian of a complex field, 


L = d,s - a" — met, (12.9) 


is invariant under multiplication of the w-operator by any phase factor, i.e. under 
the gauge transformations 


aA 


poe pi oe yp. (12.10) 


In particular, the Lagrangian is unchanged under the infinitesimal transformation 


A aA A aA 


bouwtisa- th, bod —ida- we. (12.11) 


When the “generalized coordinates” q undergo an infinitesimal change, the 
change in the Lagrangian is 


) Oe ee) a (aL 
~ (oa ax ae) 84+ Dan Ga 5a) 


(with summation over all q). The first term is zero, from the “equations of motion” 
(Lagrange’s equations). If the ‘“‘coordinates”’ q are taken to be the operators w and 
Ww’, and with 6W =ida-: wu, ds" =—ida-: wb", we obtain 


ee re ee) eee a) 
BL = ida 5e7 (bag wy ae) 


Hence we see that the condition for the Lagrangian to be invariant (SL = 0) is 
equivalent to the equation of continuity (d,j" = 0) for the 4-vector 


paige a= — =} (12.12) 
OW OW 


It is easily shown that, with the Lagrangian (12.9), this formula yields the current 
(12.8). 

Thus, in the mathematical formalism of the theory, the existence of a conserved 
current is related to the invariance of the Lagrangian under the gauge trans- 
formations (W. Pauli, 1941). The Lagrangian (12.2) of the strictly neutral field does 
not possess this symmetry. 


44 Bosons $13 


§13. The transformations C, P and T 


Unlike 4-inversion, three-dimensional (spatial) inversion is not reducible to any 
rotations of the 4-coordinate system; its determinant is —1, not +1. The symmetry 
properties of particles with respect to inversion (the P transformation) are there- 
fore not determined already by considerations of relativistic invariance.t 

The inversion operation, as applied to a scalar wave function, is the trans- 
formation 


Pw(t, r) = + w(t, -n), (13.1) 


where the plus and minus signs on the right correspond to true scalars and 
pseudoscalars respectively. 

Hence we see that two features of the behaviour of the wave function under 
inversion must be distinguished. One of these relates to the coordinate dependence 
of the wave function. In non-relativistic quantum mechanics, only this aspect was 
considered; it leads to the concept of the parity of the state (which we shall here 
call the orbital parity), describing the symmetry properties of the motion of the 
particle. If the state has a definite orbital parity (+1 or —1), this means that 


w(t, —r) = + (tr). 


The other feature is the behaviour of the wave function at a given point (which 
may conveniently be taken as the origin) under inversion of the coordinate axes. 
This leads to the concept of the internal parity of the particle. The two signs in 
(13.1) correspond to internal parity +1 and —1 (for a particle with spin zero). The 
total parity of a system of particles is given by the product of their internal parities 
and the orbital parity of their relative motion. 

The “internal”? symmetry properties of various particles appear, of course, only 
in their mutual transformation processes. In non-relativistic quantum mechanics, 
the analogue of the internal parity is the parity of a bound state of a composite 
system, such as a nucleus. In the relativistic theory, which makes no essential 
distinction between composite and elementary particles, this internal parity is no 
different from the internal parity of those particles which are regarded as elemen- 
tary in the non-relativistic theory. In the non-relativistic case, where these particles 
are regarded as unalterable, their internal symmetry properties are not observable, 
and a discussion of these would therefore be devoid of physical significance. 

In the second quantization formalism, the internal parity is expressed by the 
behaviour of the w-operators under inversion. Scalar and pseudoscalar fields 
correspond to the transformation laws 


P: #(t,r) > + W(t, 1). (1332) 


The actual significance of the action of inversion on the w-operator must be 


+ The Lorentz group together with spatial inversion is called the extended Lorentz group (in contrast 
to the original group without P, which in this connection is called the proper Lorentz group). The 
extended group includes all transformations which leave the t axis within the corresponding light cone. 
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formulated as a particular transformation of the particle annihilation and creation 
operators, such as to lead to the result (13.2). It is easily seen that such trans- 
formations are 


Fas 


P:4>+4,, b,>+b, (13.3) 


(and the same for the conjugate operators). For, on making these changes in the 
operator 


r i A —iwt+ip-r e+ iwt—ip-r 
W(t 0) = 2 7G5y (dee ES spee oth?) (13.4) 


and then changing the notation for the summation variable (p> — p), we can bring it 
to the form +¢(t, —r). Thus, if w(t, r) denotes the operator after the substitutions 
(13.3), we have 


w?(t,r) = + b(t, —r). (13.5) 


The transformation (13.3) is entirely reasonable, since inversion changes the sign of 
the polar vector p, and particles with momentum p are therefore replaced by 
particles with momentum —p. 

In (13.3) the operators dG, and bp are transformed either both with the upper sign 
or both with the lower sign. In the second quantization formalism, this expresses 
the fact that particles and antiparticles (with spin zero) have the same internal 
parity, a result which is evident because they are described by the same (scalar or 
pseudoscalar) wave functions. 

The w-operator (13.4) is also symmetrical under a transformation which has no 
analogue in the non-relativistic theory, that of charge conjugation (the C trans- 
formation). If all the operators a, and b, are respectively interchanged: 


A 


C:a,>b,, b> 4, (13.6) 


(i.e. if particles and antiparticles are interchanged), then i becomes the charge- 
conjugate operator #°, where 


b(t, r) = b(t, r). (13.7) 


This equation expresses the symmetry of the concepts of particles and antiparticles 
in the theory. 

There is an unimportant formal arbitrariness in the definition of the charge- 
conjugation transformation. The significance of the transformation is unchanged if 
an arbitrary phase factor is included in the definition (13.6): 


This would lead to 


QE4-E 
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and a twofold repetition of the transformation would again yield an identity 
(sw). All such definitions are equivalent, however. Since the properties of the 
w-operators are unchanged on multiplication by a phase factor (cf. the end of $12), 
we can simply write ye? in place of yw, and thus again obtain the definition of 
charge conjugation (13.6), (13.7). 

Since charge conjugation replaces a particle by its antiparticle, which is not 
identical with it, no new properties of a particle or a system of particles, as such, 
will in general arise. 

An exception is formed by systems comprising equal numbers of particles and 
antiparticles. The operator C transforms such a system into itself, and so in this 
case the operator has eigenstates, corresponding toethe eigenvalues C = +1 (since 
C*=1). To describe the charge symmetry, we may regard the particle and the 
antiparticle as two different “‘charge states’’ of the same particle, differing in the 
value of the charge quantum number Q = +1. The wave function of the system is 
the product of an orbital function and a “charge” function, and must be sym- 
metrical with respect to simultaneous interchange of all the variables (coordinate 
and charge) of any pair of particles. The symmetry of the “charge” function 
determines the charge parity of the system (see the Problem at the end of this 
section). 

The concept of charge parity, which arises in a natural manner for “strictly 
neutral’’ systems, must apply also to strictly neutral ““elementary”’ particles. In the 
second quantization formalism, this concept is represented by the equation 


aA 


io = + db, (13.8) 


where the plus and minus signs correspond to charge-even and charge-odd particles 
respectively. 

Relativistic invariance implies invariance under 4-inversion (see 811). For a 
scalar field operator (in the sense of 4-rotations) this means that 4-inversion must 
give 


b(t, r) > b(-t, —r) 


with the right-hand side always positive. In terms of transformations of the 
operators d,, b,, the transformation of w(t,r) into y(—t, —-r) is obtained by inter- 
changing the coefficients of e ”* and e”* in (13.4), 1.e. by making 


A 


Gy>b;, bat. (13.9) 


Since a-operators are replaced by b-operators, this involves interchange of parti- 
cles and antiparticles. We see that, in the relativistic theory, there is a natural 
requirement of invariance under a transformation in which spatial inversion (P) 
and time reversal (T) are accompanied by charge conjugation (C); this is called the 
CPT theorem.t 


+ In this discussion we are considering a particle with spin zero. The treatment given here can be 
immediately generalized to other spin values; see, for instance, §27, Problem. 
+ This theorem was enunciated by G. Liiders (1954) and W. Pauli (1955). 
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Here, however, it must be emphasized that, although the arguments given in 
§$11 and 12 and the present section are a natural development of the ideas of 
ordinary quantum mechanics and classical relativity theory, the results thus 
obtained go beyond these both in form (wW-operators including both particle creation 
and particle annihilation operators at the same time) and in content (particles and 
antiparticles). They cannot therefore be regarded as logically necessary, but 
embrace new physical principles whose correctness can be tested only by experi- 
ment. 

If the operator (13.4) transformed by (13.9) is denoted by 7 (t,r), we can 
write 


pT (tr) = b(-t, —r). (13.10) 


Thus, if 4-inversion is formulated as the transformation (13.9), we thereby 
establish also the formulation of the time-reversal transformation of the w-opera- 
tor: together with the combined inversion transformation CP, it must give 
(13.9). Using the definitions (13.3) and (13.6), we therefore find 


A 


T:4,>+4*,, b,>+5t,, (13.11) 


where the signs + correspond to those in (13.3). The significance of this trans- 
formation is obvious: time reversal not only changes motion with momentum p into 
motion with momentum —p, but also interchanges initial and final states in the 
matrix elements. The annihilation operators for particles with momentum p are 
therefore replaced by creation operators for particles with momentum —p. Making 
the substitutions (13.11) in (13.4) and changing the notation for the summation 
variable (p> — p), we obtain? 


i" (t.r)=+b'(-t,r). (13.12) 


This is similar to the general rule for time reversal in quantum mechanics: if a 
certain state is described by the wave function wW(t,r), then the “time-reversed”’ 
state is described by the function w*(—t,r). The change to the complex conjugate 
function is necessary because the “correct”? time dependence must be restored, 
after being lost through the change in the sign of t (E. P. Wigner, 1932). 

Since the transformation T (and therefore CPT) interchanges the initial and 
final states, there are no eigenstates and eigenvalues, and therefore no new 
properties of particles as such. The consequences as regards scattering processes 
will be discussed in §869 and 71. 

Let us see how the current 4-vector operator j“ (12.8) is affected by the 
transformations C, P and T. The transformation (13.2), together with (do, 0:)—> 
(do, —0;), gives 


Pi Dier> P —Di-v (13.13) 


+ If the operation T is defined without regard to the other transformations, there is the same 
arbitrariness in the choice of the phase factor as occurs for the operation C. The requirement of CPT 
symmetry implies that the phase factor can be chosen arbitrarily for only one of the transformations C 
and T. 
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as we should expect for a true 4-vector. The transformation (13.7) would give simply 
i) ee ae tes) (13.14) 


if the operators & and &* commuted. However, the non-commutativity of these 
operators is due only to that of the operators d, and @{ (or b, and b;) with the 
same p, and from the commutation rules (11.4) the interchange of these operators 
produces only terms independent of the occupation numbers, i.e. independent of 
the state of the field. Omitting these terms as unimportant, as in (11.5), (11.6), we 
return to (13.14), whose significance is evident: charge conjugation replaces parti- 
cles by antiparticles and thus changes the sign of every component of the 4-current. 

Since the operation of time reversal involves transposing the initial and final 
states, it changes the order of the factors in a product of operators. For example, 


(ht duh)? = (Op) (WY. 


Here, however, this is not important: since the w-operators commute (in the sense 
explained above), the result is unaffected by returning to the original order of 
factors. Since also (do, 0;) >(—4o, 0;) under time reversal, the current transformation 
rule is 


FE De Gar: (13.15) 
The three-dimensional vector j changes sign, in accordance with its classical 
significance. 
Finally, for the CPT transformation, 


CPT: (7°, er > (-J°, ~D-1-+ (13.16) 


in accordance with the significance of this operation as 4-inversion. Here it must be 
emphasized that, since 4-inversion is a rotation of the 4-coordinate system, it does 
not correspond to two types (true and pseudo) of 4-tensors of any rank. 

So far, we have assumed that the particles are free; but parity quantum 
numbers acquire real significance only when interacting particles are considered 
and definite selection rules are imposed which allow or forbid specified processes. 
Only conserved properties, however, can have this significance; that is, the eigen- 
values of operators which commute with the Hamiltonian of the interacting 
particles. 

Because of relativistic invariance, the CPT transformation operator always 
commutes with the Hamiltonian. For the C and P (and therefore T) trans- 
formations separately, experiment shows that the electromagnetic and strong 
interactions are invariant, and the corresponding parity quantum numbers are 
therefore conserved in these interactions. In a weak interaction, these conservation 
laws do not hold.t 


+ The idea that parity might not be conserved in weak interactions was first put forward by T. D. Lee 
and C. N. Yang (1956). The general notion that the laws of physics might not have P and T invariance 
had previously been suggested by Dirac (1949). 
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Anticipating a little, we may mention that the operator of the interaction 
between charged particles and the electromagnetic field is given by the product of 
the operator 4-vectors A and j. Since charge conjugation changes the sign of j, the 
invariance of the electromagnetic interaction under this transformation means that 
the sign of A must also be changed. Thus photons are charge-odd particles. 

This behaviour of the operators A is in accordance with the properties of the 
4-pnotential in the classical theory: from the transformations 


C: (Ao, A)>(—Ao, —A):, 
P: (Ap, A) ae (Ao, —A); —Io 
CPT: (Ao, A) > (— Ao, —A)-1,-+, 


it follows that 
T: (Ao, A) aes (Ao, —A)-11, 


in agreement with the classical rule for the transformation of the electromagnetic 
field potentials under time reversal. 

The requirement of CPT invariance does not impose any limitations on the 
properties of the particles themselves, but it implies certain relations between those 
of particles and antiparticles. Firstly, their masses must be equal, as is evident from 
the relation described in §11 between 4-inversion and the basis of the concept of 
particles and antiparticles. Next, it follows from CPT invariance that there is only a 
difference of signin the proportionality coefficients between the electric and magnetic 
moment vectors and the particle and antiparticle spin vector. The magnetic moment 
changes sign under the C and T transformations but (being an axial vector) is not 
affected by the P transformation. Hence the CPT transformation, which converts a 
particle into an antiparticle, does not change the sign of the magnetic moment; the spin 
vector does change sign. The same applies to the electric moment, which is unchanged 
by time reversal but changes sign under the C transformation and (being a polar 
vector) under spatial inversion. 

The requirements of P and T invariance (if complied with) restrict the proper- 
ties of each particle, prohibiting the existence of an electric dipole moment: the 
only vector that can be constructed from the w-operators of an elementary particle 
at rest is its spin operator vector, which is P-even and T-odd, and can therefore give 
rise to a magnetic moment but not an electric moment. We must emphasize that 
either P invariance or T invariance is sufficient to invoke this prohibition. 


PROBLEM 


Determine the charge and spatial parities of a system of two particles with spin zero (particle and 
antiparticle) and orbital angular momentum of relative motion l. 

SOLUTION. Interchanging the coordinates of the particles is equivalent to inversion (about their 
centre of mass), and therefore multiplies the orbital function by (—1)'; interchanging the charge variables 
is equivalent to charge conjugation, and multiplies the “charge” factor in the wave function by the 
required parity C. The condition C(—1)' = 1 gives 


Cc =(-1). 
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The spatial parity P of the system is the product of the orbital parity and the internal parities of the two 
particles. Since the particle and the antiparticle have the same internal parity, in this case P is equal to 
the orbital parity: 


P =(-ly. 


§14. The wave equation for a particle with spin one 


A particle with spin one is described in its rest frame by a three-component 
wave function, a three-dimensional vector; such a particle is often called a vector 
particle. The four-dimensional origin of this vector may be as the three spatial 
components of the space-like 4-vector Ww” or the mixed components of the 
antisymmetric 4-tensor w’” of rank two; the time component w° and the space 
components ys“ are zero in the rest frame.t 

The wave equation is a differential relation between the quantities yw" and ys’, 
and will be written as the equations 


Wyv = Pu, =2 Pith, (14.1) 
im*by. = War (14.2) 


with p =i0 (A. Proca, 1936). Applying the operator p” to both sides of equation 
(14.2), we have 


p's, = 9, (14.3) 


since W,, ts antisymmetric. 
By substituting (14.1) in (14.2) to eliminate y,,, and using (14.3), we obtain 


(p’—m*)h, = 0, (14.4) 


whence it is again evident (cf. §10) that m is the mass of the particle. Thus a free 
particle with spin one can be described by a single 4-vector w", whose components 
satisfy the second-order equation (14.4), and also the further condition (14.3), 
which eliminates from w" the part pertaining to spin zero. 

In the rest frame, where wW, is independent of the spatial coordinates, we find 
that p°w =0- Since also pow = mu, it is seen that in the rest frame = 0, as it 
should be, and the w are likewise zero. 

A particle with spin one can have different internal parities, according as w" is a 
true vector or a pseudovector. In the former case 


Py = (f°, -'), 
and in the latter case 
Py = (-y°, w'). 


+ Anticipating, we may mention that the ensemble of the 4-vector w, and the 4-tensor yw” 
corresponds to that of the 4-dimensional spinors of rank two &°°, nag, ¢°°, where €° and nag are 
symmetrical spinors changed into each other on inversion (819). 
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Equations (14.1), (14.2) can be derived from the variational principle, using the 
Lagrangian 


L= ahr _ ays" * (dup, _ dW, ) _ ays” (db _ d,s) a mw *, (14.5) 


The independent generalized coordinates are here represented by w,, w*, wW,,, w%,.T 

To find the energy-momentum tensor, formula (10.11) is not entirely suitable 
here, since it would lead to an unsymmetrical tensor requiring further sym- 
metrization. Instead, we can use the formula 


6 a(LV=g) , LV~8) 


1 ee, 
7A Ora & ax* ag’, agh” ) 


(14.6) 


in which L is assumed to be expressed in a form appropriate to any curvilinear 
coordinates (see Fields, $94). If L contains only the components of the metric 
tensor g,,, and not their derivatives with respect to the coordinates, the formula 
becomes simply 


(since d log g = — g,,dg"”). 

Since the differentiation in formula (14.6) is not with respect to the quantities 
wW., W,,, these quantities need not be regarded as independent when applying the 
formula; we may immediately make use of the relationship (14.1) to rewrite the 
Lagrangian (14.5) as 

L = —4t,,.0*,2"9'? + mb,w*e"”. (14.7) 
Then 
dif aaa Wirt>* — Wy + mbes, a wu, ) 5 Sur (Prot? * = mptyp*). (14.8) 
In particular, the energy density is given by the essentially positive expression 


Too = sine + Woh + m° (poss + hap). (14.9) 


The conserved current density 4-vector is given by 


JP = ip, — wh). (14.10) 
This can be obtained, in accordance with (12.12), by differentiating the Lagrangian 


+ If the variation were made with respect to ws, only (assuming w,, already expressed in terms of ys, 
by (14.1)), equation (14.3) would have to be imposed as an additional condition unrelated to the 
variational principle. 
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(14.5) with respect to the derivative 0,,. In particular 
P= ie — WWD) (14.11) 


and is not an essentially positive quantity. 
A plane wave normalized to one particle in the volume V = 1 is 


= ! —ipx a 
Ye = (dey HH? ; ia = =], (14.12) 


where u, is the unit polarization 4-vector, which, by (14.3), satisfies the condition 
of four-dimensional transversality, 


u,p" = 0. (14.13) 
For, on substituting the function (14.12) in (14.9) and (14.11), we obtain 
To = —2e*,yp"* = e, ya. 
Unlike the photon, a vector particle with non-zero mass has three independent 
directions of polarization. The corresponding amplitudes are given in (16.21). 


The density matrix for partially polarized vector particles is defined so that in a 
pure state it reduces to the product 


Pur = UU? 


(similarly to (8.7) for photons). According to (14.12) and (14.13), it satisfies the 
conditions 


Dp, =0, ph=—L. (14.14) 


For unpolarized particles, p,, must have the form ag,,+bp,p, When the 
coefficients a and b are found from (14.14), the result 1s 


Puv = — 3(8u» — Pup’). (14.15) 


The quantization of the vector particle field is entirely analogous to the scalar 
case, and there is no need to repeat the arguments. The wW-operators of the vector 
field are 


“a 1 A a —ipx a a ipx 
Yn = 2 yp) Goan? € B + by, ™ e'?*), 
p, a 
(14.16) 


. 1. ies | 
b= > Ve) (a, eb, e?), 
p, @ 


where the suffix a labels the three independent polarizations. 
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As in the scalar case, Bose quantization is necessary because the expression 
(14.9) for To is positive definite and the expression (14.11) for j° is not. 

There is a close connection between the properties of strictly neutral vector and 
electromagnetic fields. The neutral vector field is described by an Hermitian 
w-operator: 


bh => TOs) (Co eS Cau er), (14.17) 
p.a@ 


The Lagrangian of this field is 


aA A 


L = gdh” — 2h" (auth, — yp.) tam bp". (14.18) 


The electromagnetic field corresponds to m = 0. The 4-vector ys" then becomes 
the 4-potential A”, and the 4-tensor w"*” becomes the field tensor F"’, which is 
related to the potential by the definition (14.1). Equation (14.2) becomes 0’y,, = 0, 
corresponding to the second pair of Maxwell’s equations. This does not imply the 
condition (14.3), which therefore 1s no longer obligatory. Since the extra condition 
has disappeared, there is no need to regard wh, and Wav as independent “‘coor- 
dinates” in the Lagrangian, and (14.18) becomes 


= lige (14.19) 


in agreement with the familiar classical expression for the Lagrangian of the 
electromagnetic field. This Lagrangian, like the tensor urs is invariant under any 
gauge transformation of the “potentials” y,. There is an evident connection 
between this property and the zero mass: the Lagrangian (14.18) does not possess 
the property, because of the term m7". 


815. The wave equation for particles with higher integral spins 


Since the wave equations (14.3), (14.4) follow immediately when the particle 
mass and spin are given, the practical utilization of the Lagrangian involves not so 
much the derivation of these equations as the establishment of expressions for the 
field energy, momentum and charge. 

To do so we can, as already mentioned, use in place of (14.5) the expression 
(14.7), and the latter can be further transformed as follows. From (14.1), it can be 
rewritten as 


L = — (A.W )(a"p”) + (a,We(Oep’) + mp pe* 
=~ (Buh VOMw’) + mrp” + a, (WEap’) — PEaranh”. 


The last term is zero, by (14.3), and the one preceding it is a total derivative. 
Omitting this, we obtain the Lagrangian 


L’ = — (dh *)(0" Wh) + mye". (15.1) 
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This has the same form as the Lagrangian (10.9) for a particle with spin zero, the 
only difference being that the scalar w& 1s replaced by the 4-vector ws, and the sign is 
changed. The change of sign occurs because y&, 1s a space-like vector, so that 
wiw"* <0, whereas for a scalar particle ww* > 0. 

On constructing the energy-momentum 4-tensor and the current 4-vector from 
the Lagrangian (15.1), we obtain expressions of the same form as (10.12) and 
(10.18) for the scalar field: 


Tips alt OO * Oh L gia (15.2) 

ju = — WTO  — CO. DY]. (15.3) 

Thd difference between these and (14.8), (14.10) is again a total derivative. But it 

has already been stressed that the local values of these quantities have no profound 

physical significance. Only the volume integrals P, (10.15) and Q (10.19) are 
important, and these will be the same for either choice of T,,, and j,. 

This method of description can be immediately generalized to particles with any 

(integral) spin. The wave function of a particle with spin s is an irreducible 4-tensor 


of rank s, i.e. a tensor symmetrical in all its indices and vanishing on contraction 
with respect to any pair of indices: 


ee ie | ee aay ae ue,” =0. (15.4) 

This tensor must satisfy the additional condition of 4-transversality: 
Pb... =0, (15.5) 

and each of its components must satisfy the second-order equation 
(p°— m*)p..=0. (15.6) 
In the rest frame, the condition (15.5) means that every component of the 4-tensor 
whose indices include a zero must vanish. Thus the wave function in the rest frame 
(i.e. in the non-relativistic limit) is equivalent, as it should be, to an irreducible 

3-tensor of rank s, the number of independent components of which is 2s + 1. 

The Lagrangian, the energy-momentum tensor and the current vector for a field 


of particles with spin s differ from (15.1)-(15.3) only in that ys, is replaced by a,.... 
The normalized plane wave is 


1 - 
BV... piv... Ipx * Vee 
ws V(2e) u Cen, ux, ue 1, (15.7) 


the wave amplitude satisfying the conditions 
u-’p, =0. (15.8) 


There are 2s + 1 independent polarization states. 
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The quantization of the field is effected by an obvious generalization from the 
cases of spin zero and one. 

The procedure given above is entirely sufficient for the stated purpose: to 
describe a field of free particles. The situation is different if it is proposed to 
describe the interaction of the particles with an electromagnetic field. This inter- 
action would have to be included in the Lagrangian in order to yield all the 
equations without the need to impose additional conditions. In practice, however, 
this description of the interaction is found to be applicable only for electrons, 1.e. 
particles with spin 3 (see §32). For other spin values, therefore, the problem is only 
of methodological interest. 

For any spin s > 1 (integral or half-integral), 1t proves impossible to formulate a 
variational principle by means of a single (tensor or spinor) function whose rank 
corresponds to the given spin. It is necessary to use additional tensor or spinor 
quantities of lower rank. The Lagrangian is then so chosen that these auxiliary 
quantities must be zero on account of the free-particle field equations which follow 
from the variational principle.t 


$16. Helicity states of a particlet 


In the relativistic theory the orbital angular momentum | and the spin s of a 
moving particle are not separately conserved. Only the total angular momentum 
j —1+s 1s conserved. The component of the spin in any fixed direction (taken as the 
z-axis) is therefore also not conserved, and cannot be used to enumerate the 
polarization (spin) states of the moving particle. 

The component of the spin in the direction of the momentum is conserved, 
however: since l=rXp the product s-n is equal to the conserved product 
j-n (n= p/|p|). This quantity is called the helicity; it has already been mentioned in 
§8 in relation to the photon. Its eigenvalues will be denoted by A (A = —S,..., +8), 
and states of a particle having definite values of A will be called helicity states. 

Let y,, be the wave function (plane wave) describing the state of a particle with 
definite values of p and A, and u™(p) its amplitude; to simplify the notation, we 
shall omit the indices for the components of this function (4-tensor indices for a 
particle with integral spin). 

It has been shown in earlier sections that a wave function with more than 2s + 1 
components is needed in order to give a relativistic description of particles with 
non-zero (integral) spin. But the number of independent components remains equal 
to 2s + 1; the ‘“‘extra’’ components are eliminated by imposing additional conditions 
which cause these components to vanish in the rest frame. In Chapter III this will 
be shown for half-integral s also. 

According to the formulae for transformation of the angular momentum (see 
Fields, $14), the helicity is invariant under those Lorentz transformations which do 
not alter the direction of p along which the angular momentum component Is taken. 
The number A therefore remains a good quantum number under such 


+ See M. Fierz and W. Pauli, Proceedings of the Royal Society A173, 211, 1939. The procedure indicated 
above is carried out in this paper for particles with spin 3/2 and 2. 
+ The discussion in this section relates to particles with any spin (integral or half-integral). 
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transformations, and the symmetry properties of helicity states can be studied by 
means of a frame of reference in which the momentum |p| < m (in the limit, the rest 
frame). Then y,, reduces to a non-relativistic wave function with 2s + 1 components. 
Let its amplitude be denoted by wn), the argument being the direction n= p/|p| 
along which the angular momentum is quantized. The amplitude w™ is an eigen- 
function of the operator n° $8: 


(n> §)w'(n) = AW (n). (16.1) 


In the spinor representation, w“ is a contravariant symmetrical spinor of rank 2s; 
according to the correspondence formulae (QM, (57.2)), its components can also be 
enumerated by the corresponding values of the spin component o along a fixed 
Z-axis.t 

In the momentum representation, the wave functions of the states considered 
are essentially the same as the amplitudes u“(p): 


Vor (k) = uw) Cw = n) = w(p)5(w = n), (16.2) 


where the momentum as an independent variable is denoted by k, as contrasted 
with its eigenvalue p, and v = k/|k|, as against n = p/|p|.¢ In the non-relativistic limit, 


Wav) = wv) bO(v — n) = w(n)5 (vy — n). (16.3) 
This expression should be written in the more explicit form 
Waly, o) = wP(v)dC(v — n), 


showing the discrete independent variable o. 

The helicity operator §-n commutes with the operators j, and j?, since the 
angular momentum operator is related to an infinitesimal rotation of the coor- 
dinates, and the scalar product of two vectors is invariant under any rotation. 
There exist, therefore, stationary states in which the particle simultaneously has 
definite values of the angular momentum j, its component Jj, = m, and the helicity A. 
Such states will be called spherical helicity states. 

Let us determine the wave functions of these states in the momentum 


+ These arguments, like the possible values shown for A, apply to particles with non-zero mass. For 
massless particles there is no rest frame, and the helicity can take only the two values A = +s. This is 
because of the fact already mentioned in §8, that the states of such a particle are classified by their 
behaviour with respect to the axial-symmetry group, which allows only twofold degeneracy of levels (as 
regards the properties of the wave equation, this means that in the limit as m > 0 the set of equations for 
a particle with spin s separates into independent equations corresponding to massless particles with 
spins s,s—1,...). For example, the photon has \ =+1, and the corresponding w™ are the three- 
dimensional vectors e~” (8.2). 

< The delta function 5 is defined so that 


| 6 — n)doy = 1. 


The delta function which imposes a fixed value of the energy is omitted in (16.2), and similarly in (16.4) 
below. 
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representation. This may be done by direct analogy with the formulae derived in 
QM, §103 for the wave functions of a symmetrical top. They were obtained there 
on the basis of the formulae for the transformation of wave functions under finite 
rotations (QM, §58). These in turn were based solely on the symmetry properties 
with respect to rotation, and are therefore applicable to functions in the momentum 
representation just as much as to coordinate functions. 

In addition to the coordinates x, y, z fixed in space (with respect to which the 
functions Wm, are written), we shall also use “moving” coordinates €, n, ¢, with the 
¢-axis in the direction of v. Without repeating the argument (cf. the derivation of 
QM, (103.8)), we can write 


Pima (Kk) = Yin Dimn(v), 


where w)) is the wave function in the moving coordinates, describing the state of a 
particle with a definite value of the ¢-component of the angular momentum, j, = A; 
in the momentum representation, of course, this function is the same as the 
amplitude u°’. The normalized wave function (see below) is 


Vimx (K) = J “Lit Diim(v)u(k). (16.4) 


Here, however, there is a question of the choice of phases, because of the 
following non-uniqueness: a rotation of the coordinates €, n, ¢ relative to x, y, z is 
defined by three Eulerian angles a, B, y, whereas the direction of v, on which the 
particle wave function can alone depend, is defined by the two spherical angles a = # 
and B = 0. Itis thus necessary to agree on some definite choice of the angle y. We shall 
take y =0, defining D®,(v) as. 


Din(v) = Din(p, 0,0) = e'"* dyi(0). (16.5) 


From QM, (58.21), the functions (16.5) are seen to satisfy the orthonormality 
conditions: 


do, 1 
| D&,*@) DB) Ae = +1 Oinip Om my 


(16.6) 


where do, = sin 6 dé dd. The orthogonality of the functions Wim, with respect to the 
suffix A is ensured by the factor u“’. Thus the functions Wn, are orthogonal in all 
three suffixes, as they should be, and with the coefficient chosen in (16.4) they are 
normalized by the condition® 


| | ima |” doy = 1. (16.7) 


Here we assume that the amplitudes u” are normalized to unity: u”u®* = 1. 

Let us now consider the behaviour of the wave functions of helicity states 
under inversion of the coordinates. The product of the polar vector v and the axial 
vector j is a pseudoscalar. It is therefore obvious that inversion will change a state 
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with helicity A into one with helicity —A; all that is necessary is to determine the 
phase factors in these transformations. 

Under inversion, vy —v. The vector v is defined by the two angles ¢ and 0, and 
the transformation v—-—v is brought about by the changes 6-gd+7, 0-7 — 0. 
This determines the f-axis but leaves indefinite the position of the € and ny axes, 
which depends also on the third Eulerian angle y; the transformation of 6 and @ 
alone does not distinguish, in this sense, between reflection of the coordinates and 
rotation of the ¢-axis. Expressed in terms of all three Eulerian angles, inversion 1s 
the transformation 


a=od-ot+7, B=O0-7-0, yror-y. (16.8) 


Hence, if D{?,(v) is defined as in (16.5) (i.e. with y = 0), and the transformation 
v—-—v is regarded as being the result of inversion, then 


DY?.(—v) = Did + a, 7 — 8, 77). (16.9) 
From formulae QM (58.9), (58.16) and (58.18) we hence find 


Din(—v) = e7 doi(m — 8) emer 
=(-1* ed, (8) 
= (-1"DS\ m(d, 8, 0), 
Or 
Din(—v) = (“1 DY, nv), ve) 


where j — A Is an integer. 
A similar formula for the spinor w®) can be obtained by noticing that its 
components w™ are the same, apart from a factor, as the functions 


wp) ~ DOW)*. (16.11) 


For, by applying the transformation formulae QM (58.7) to the spin eigenfunctions 
and taking the ¢-component of the spin to have a definite value A (i.e. replacing Wm 
by 5m, on the right-hand side of QM (58.7), we find that D$?(v) are the spin wave 
functions corresponding to definite values of the z and £ components (o and A) of 
the spin. The set of these functions with o = —s,...,+s forms, according to the 
correspondence formulae (QM (57.6)), a covariant spinor of rank 2s. The com- 
ponents of the contravariant spinor, which according to the formulae QM (57.2) 
correspond to the components w®, are transformed as the complex conjugates of 
the components of the covariant spinor of the same rank. 
From (16.10) and (16.11), we have 


w(—p) = (-1) wp), (16.12) 


where s—. is an integer. The inversion operation applied to w™), however, not 
only changes v into —v but also multiplies w“’ by a common phase factor (the 
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“internal parity” of the particle), which we shall denote by 7: 
Pw(v) = nw(-v) = n(-1) W700). (16.13) 
For the relativistic amplitude uk), this transformation becomes 


Pu(k) = nBu(-k) 
= 9(- 1) M"(k), (16.14) 


where B is a certain matrix which is a unit matrix with respect to the components 
of u“’ which remain in the limit |p| 0. It is important to note that this matrix does 
not depend on the quantum numbers of the state, and in this sense the difference 
between (16.13) and (16.14) is unimportant.t 

On applying (16.14) to (16.2), we obtain the law of transformation of the wave 
functions of the states |nA): 


Par (v) = n(- 1) Wn, -a (0). (16.15) 


For spherical helicity states, using (16.10) and (16.12), we obtain the transformation 
law 


Pim (v) = H(- 1) Wim, -a(v). (16.16) 


The states Wino are transformed into themselves, according to (16.16), i.e. they 
have a definite parity. If 14+ 0, however, only superpositions of states with opposite 
helicities have a definite parity: 


Wimla| = os (Pima = Wim, -a)- (16.17) 


On inversion, these are transformed into themselves: 
Pusmtal(v) = = n(- Il Wha). (16.18) 


It should be noted that in this section we have arrived at a classification of 
states of a free particle with a given angular momentum, using only conserved 
guantities and without invoking the concept of the orbital angular momentum 
(which was employed, for instance, in §§6 and 7 for classifying photon states). 

As an example, let us consider the case of spin one. In the rest frame the 
amplitudes u” (4-vectors) become the three-dimensional vectors e”’, which here 
take the place of the amplitudes w™’. The action of the operator of spin one on the 
vector function e is given by the formula 


(Sie), = — lejxies 5 (16.19) 


+ For example, when s =1 the amplitudes u™ are the 4-vectors (16.22); B is then entirely a unit 
matrix with respect to the 4-vector indices, Bu, = 6... When s = 3, as we shall see in Chapter III, u” is a 
bispinor, the phase factor y =i, and B is the Dirac matrix y’ (see (21.10)). 
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see QM, §57, Problem 2. Thus equation (16.1) becomes 
in x e = re, (16.20) 


The solutions of this equation (in €nf coordinates with the ¢-axis in the direction of 
n) are the same as the spherical unit vectors (7.14): 


e =i(0,0,1), e*= Fy (l, +i, 0). (16.21) 


In a frame of reference in which the particle has momentum p, the helicity state 
amplitudes are the 4-vectors 


yO = (el =e yt De = (0 e(@)) (16 22) 
m’ m b ° . 
If e is a polar vector, then »=—1, and the functions (16.17), which are 


three-dimensional vectors when s = 1, have the following parities: 


Wimlal: P = (—-1)), 
Wimlaj P= (—1)"1, 
Wimo: P = (—1)’. 


On comparing with the definition of the spherical harmonic vectors (7.4), we see 
that these functions are identical (apart from phase factors) with Y), Yi”, Y‘? 
respectively. After ascertaining the phase factors (by comparing values for 6 = 0, 


say), we obtain the equations 
yig= i A Dg +e D0) 
ye =i A*T wpe, +e D4,,), (16.23) 
yg=i V7} og, 


where j is an integer; e’? =n xe"? are spherical unit vectors along axes é', n’, ¢ 


which are obtained from é, 7, ¢ by a rotation of 90° about the f-axis. 
The last formula (16.23) is equivalent to the expression QM. (58.23) for d§),(6). 
The first or second formula (16.23) leads to a simple expression for the functions 


+ The choice of phase factors is determined by the condition that the spin operator matrix elements 
calculated with the eigenfunctions (16.21) must be in accordance with the general definitions in QM, 
8§27 and 107. 
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d!, ,. We have 


ete cee eer % 
ha ae 


1 (+1) 
eae VYin. 
VuiG+DI° i 


The scalar product on the right can be written explicitly in the coordinates €, », ¢, 
with 
(= 0 ) (= 1 0 ) 
—,—-)-(—, =—- = |. 
0& an 06’ sin 6 dd 
With the definitions (7.2) of Yjm and (16.5), the result is 


(i) _ ¢_4y\mt1 (j-—m)! ( oO m m ss 
di?) m(@) = (-1) Vas + 2a +and P;'(cos 0), m=O. 


(16.24) 
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§ 17. Four-dimensional spinors 


IN THE non-relativistic theory, a particle with arbitrary spin s is described by a 
quantity with 2s + 1 components, a symmetrical spinor of rank 2s. These quantities 
are, mathematically, realizations of the irreducible representations of the spatial 
rotation group. 

In the relativistic theory, this group is only a subgroup of the wider group of 
four-dimensional rotations, the Lorentz group. It is therefore necessary to develop 
the theory of four-dimensional spinors (4-spinors), as quantities which are realiza- 
tions of the irreducible representations of the Lorentz group. This theory will be 
given in §§17-19. In §§17 and 18 we shall consider only the proper Lorentz group, 
which excludes spatial inversion; the latter will be dealt with in §19. 

The theory of 4-spinors is analogous in structure of that of three-dimensional 
spinors (B. L. van der Waerden, 1929; G. E. Uhlenbeck and O. Laporte, 1931). 

A spinor é* is a quantity having two components (a = 1,2); as components of 
the wave function of a particle with spin 3, é' and é correspond to the respective 
eigenvalues +3 and —3 of the z-component of the spin. Under any transformation 
belonging to the (proper) Lorentz group, the two quantities é' and &* are trans- 
formed into linear combinations of themselves: 


oo 1 2 
more (17.1) 


The coefficients a, B, y, 6 are definite functions of the angles of rotation of the 
4-coordinate system, and must satisfy the condition 


ad — By =1; (17.2) 


that is, the determinant of the binary transformation (17.1) is equal to unity, as are 
the determinants of the coordinate transformations in the Lorentz group. 

Because of the condition (17.2), the bilinear form &'2’- €’=' (where &* and 
= are two spinors) is invariant under the transformation (17.1), and corresponds to 
a particle with spin zero which “consists” of two particles with spin 3. In order to 
write such invariant expressions in a natural way, the “‘covariant” components é, 
are used as well as the “‘contravariant’? components &* of the spinor. Their 


relationship is governed by the “metric spinor”’ g,,:T 


ba = Bapé", (17.3) 


+ The spinor indices will be denoted by the letters at the beginning of the Greek alphabet: a, B, 
ig 
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where 
0 1 
Bap = é ai) (17.4) 
so that 
&,= &, &=—€'. (17.5) 
Then the invariant é'=’-— &&' becomes the scalar product €°=,, and &°3, = 


The properties so far stated are formally the same as those of three-dimensional 
spinors. A difference arises, however, when complex-conjugate spinors are con- 
sidered. 

In the non-relativistic theory, the sum 


wis + wey, (17.6) 


which determines the probability density for the localization of the particles in 
space, must be a scalar, and the components w** must therefore be transformed as 
the covariant components of a spinor; the transformation (17.1) must therefore be 
unitary (a = 6*, B = — y*). In the relativistic theory, however, the particle density 
is not a scalar, but is the time component of a 4-vector. The above-mentioned 
condition therefore no longer applies, and the transformation coefficients need 
satisfy no condition other than (17.2). The four complex quantities a, B, y, 6 under 
the condition (17.2) alone are equivalent to 8 — 2 = 6 real parameters, in accordance 
with the number of angles which define a rotation of the 4-coordinate system 
(rotations in six coordinate planes). 

Thus complex-conjugate binary transformations are quite different, and in the 
relativistic theory there exist two types of spiors. A special notation is customary, 
in order to distinguish these two types: the indices of spinors which are trans- 
formed by the complex conjugate formulae to (17.1) are written with dots over 
them and are called dotted indices. Thus, by definition, 


ae Se (17.7) 


where the sign ~ denotes “is transformed as’’. The transformation formulae for a 
“dotted” spinor are therefore 


qf ota Bg, a Syn Pot. (17.8) 


The operations of raising and lowering the dotted indices are carried out in the 
same way as for the undotted indices: 


mi=W,  m=—n'. (17.9) 


The behaviour of 4-spinors as regards spatial rotation is the same as that of 
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3-spinors, for which, as we know, w* ~ w*. According to the definition (17.7), the 
4-spinor n, therefore behaves under rotations in the same way as the contravariant 
3-spmor w*. The covariant components nj; and 7; therefore correspond, as the 
components of the wave function of a particle with spin 3, to the eigenvalues 3 and 
—; of the spin component. 

Spinors of higher rank are defined as sets of quantities which are transformed 
as products of the components of a number of spinors of rank one. The indices of 
these spinors of higher rank may be partly dotted and partly undotted. For 
example, there exist three types of spinors of rank two: 


EF sees Ee? pee 25, EX B n isu n*He. 


In this respect, the statement of just the total rank of a spinor does not uniquely 
define it; we shall therefore, where necessary, indicate the rank as a pair of 
numbers (k,1), the numbers of undotted and dotted indices respectively. 

Since the transformations (17.1) and (17.8) are algebraically independent, it is 
not necessary to specify the sequence of dotted and undotted indices; in this sense 
the spinors £°° and £°*, for example, are the same. 

In order to be invariant, every spinor equation must have on each side the same 
numbers of undotted and dotted indices, since otherwise the equation could not 
remain valid when the frame of reference was changed. Here we must remember 
that taking the complex conjugate implies interchanging dotted and undotted 
indices. The relationship n°* = (é€%°)* between two spinors is therefore invariant. 

Spinors or their products can be contracted only. with respect to pairs of indices 
of the same kind (dotted or undotted); summation with respect to two indices of 
different kinds is not an invariant operation. Hence, from the spinor 


ala as ee (17.10) 


which is symmetrical in all k undotted indices and in all | dotted indices, we can 
obtain no spinor of lower rank (since contraction with respect to a pair of indices in 
which the spinor is symmetrical gives zero). Thus we cannot construct from the 
quantities (17.10) a smaller number of linear combinations of them which in turn 
are transformed into linear combinations of themselves by every transformation in 
the group. That is, the symmetrical 4-spinors are realizations of the irreducible 
representations of the proper Lorentz group. Each irreducible representation is 
specified by the pair of numbers (k, 1). 

Each spinor index takes two values, and there are therefore k +1 essentially 
different sets of numbers a;, a2,..., a, in (17.10) (containing 0, 1, 2,..., k ones and 
k,k—-1,...,0 twos) and 1+ 1 sets of numbers Bi, Bo, hs Bi. The symmetrical spinor 
of rank (k, 1) thus has a total of (k + 1)(1+ 1) independent components, and this is 
also the dimension of the corresponding irreducible representation. 


$18. The relation between spinors and 4-vectors 


The spinor ¢ 8 with one dotted and undotted index, has 2 x 2 = 4 independent 
components, the same as the number of a 4-vector. It is therefore clear that both 
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are realizations of the same irreducible representation of the proper Lorentz group. 
and that there must consequently be a certain relation between their components. 
In order to ascertain this relation, let us first consider the corresponding relation 
in the three-dimensional case, using the fact that 3-spinors and 4-spinors must 
behave in the same manner with respect to purely spatial rotations. 
For the three-dimensional spinor w°’, the correspondence formulae are as 
shown in QM, 857; they will here be written as 


dy = 20? — yp) = 20? + pr), 
ay = —31(b? + wh") = Qh! w?)). 
a, = 2? +") = xh — br), 


where d,, dy, a; are the components of a three-dimensional vector a. For the 
four-dimensional case, the components w*, must be replaced by ¢°°, and ay, ay, az 
must be taken to be the contravariant components a', a*, a’ of a 4-vector. The 
form of the expression for the fourth component a” is evident from the fact, noted 
in §17, that the quantity (17.6) must transform as a°. Hence a’~ £''+¢”, the 
coefficient of proportionality being determined so that the scalar ¢,,¢°" is the same 
as the scalar 2a,a" =2a’. 
Thus we obtain the correspondence formulae 


a=MEP4 0), a= sie? ae (18.1) 
a3 =! = £7), a° =(¢"! a 7 =), 
The inverse formulae are 
Sea pa, CS Sa a, 
eo 1 2 | ae see F . (18.2) 
C°=-Ly=a' —ia, ¢ (i=a tla, 
with 
ipa aa. (18.3) 
Moreover 
nor Sela (18.4) 


as is seen from the fact that the spinor ,,s¢,°, of rank two, is antisymmetric in the 
indices a, y, and is therefore proportional to the metric spinor. 

The correspondence between the spinor ¢°° and the 4-vector is a particular case 
of a general rule: any symmetrical spinor of rank (k,k) is equivalent to a sym- 
metrical 4-tensor of rank k which is irreducible (i.e. which gives zero on contrac- 
tion with respect to any pair of indices). 

The relation between the spinor and the 4-vector may be written in a compact 
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form by means of the two-rowed Pauli matricest 


Ox = & -) Oy = ( ‘af. Oo, = ( e (18.5) 


If the matrix of the quantities ¢ °8 (with the indices raised and the first undotted) is 
symbolized by ¢, then formulae (18.2) become 


C=a-ata’, (18.6) 


the second term denoting of course the product of a° and a unit matrix. The inverse 
formulae are 


a= tr (fo), a =e (18.7) 


Using formulae (18.6), (18.7), we can determine the relation between the laws of 
transformation of the 4-vector and the spinor, and thus express the law of 
transformation of the spinor in terms of the parameters of rotations of the 
4-coordinates. 

We write the transformation of the spinor &€* in the form 


é*" = (BE)*, B=(° ) (18.8) 


where B is a two-rowed matrix formed from the coefficients of the binary 
transformation. Then the transformation of the dotted spinor is 


1 ®' = (B*n)* = (B*)*, (18.9) 


and the transformation of the spinor £%8 ~ Evy 8, of rank two, may be symbolized 
ast ¢'’= B¢B". For the infinitesimal transformation B =1+A, where A is a small 
matrix, we have as far as first-order quantities 


C= £4+OC+ 4A"). (18.10) 


Let us first consider the Lorentz transformation to a frame of reference moving 
with an infinitesimal velocity 5V (without change in direction of the space coor- 
dinate axes). Then the 4-vector a“ =(a°,a) is transformed as follows: 


a’ =a-—a’dV, a” = a°—a-8V. (18.11) 


+ To simplify the notation, matrix operators acting on spin variables are written without circumflexes. 
+t For the covariant components we have 


&, = (B'é). = (Ba, 
Na = (MB*')a, (18.8a) 


so that the product €&* of two spinors remains invariant. 
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We now make use of formulae (18.7). The transformation of a°® may be represen- 
ted, firstly, as 


Al Sey a°—4tr(fo - dV); 
secondly, as 


a” =5trZ’=a°t+3tr(AC+QA*) 
=a°+3trfC(A +2"). 
These two expressions must be identically equal (i.e. equal for all values of ¢). 
Hence 


A+A™=—o- BV. 
Treating the transformation of a in the same way, we find 
GA +A o= — BV. 
These equations, as equations for A, have the solution 
AN=At= —30'-8V. 
Thus an infinitesimal Lorentz transformation of the spinor &“ has the matrix 
B=1-i0 -ndV, (18.12) 


Where n is a unit vector in the direction of the velocity 6V. From this we 
can easily find the transformation for a finite velocity V. To do so, we recall 
that a Lorentz transformation signifies (geometrically) a rotation of the 4-coor- 
dinates in the plane of t and n through an angle ¢ which is related to the velocity V 
by? tanh ¢ = V. An angle 5 = &V corresponds to an infinitesimal transformation, 
and a rotation through a finite angle @ is carried out by a ¢/5¢-fold repetition of a 
rotation through 6. Raising the operator (18.12) to the power @/5¢@ and taking the 
limit 6¢ > 0, we obtain 


1 

Bee’. (18.13) 
The mathematical significance of this operator is seen by noticing that, from the 
properties of the Pauli matrices, all even powers of n+ o@ are equal to 1, ana all odd 


powers are equal to n+ o. Since the expansions of the hyperbolic sine and cosine 
contain respectively odd and even powers of the argument, we have finally 


B =coshi¢ —n-@ sinhid, 


(18.14) 
tanh @ = V. 


+ The metric is pseudo-Euclidean in planes containing the time axis. 
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The matrices B of the Lorentz transformations are Hermitian: B = B’* 
Let us now consider an infinitesimal rotation of the space coordinates. The 
three-dimensional vector a is transformed as follows: 


a’=a-— 60Xa, (18.15) 


where 680 is the vector of the infinitesimal angle of rotation. The corresponding 
transformation of a spinor may be found similarly. There is no need to do so, 
however, since the behaviour of 4-spinors under spatial rotations is the same as 
that of 3-spinors, and the transformation of the latter is known from the general 
relationship between the spin operator and the operator of an infinitesimal rotation: 


B=1+3io- 50. (18.16) 


The change to a rotation through a finite angle 6 is made in the same way as that 
from (18.12) to (18.14): 


B = exp (sin o) = cos36 + in: o sin 56, (18.17) 


where n is a unit vector along the axis of rotation. This matrix is unitary (B* = B™'), 
as it should be fot a spatial rotation. 


$19. Inversion of spinors 


The discussion (in QM) of the three-dimensional theory of spinors did not 
consider their behaviour under the operation of spatial inversion, since in the non- 
relativistic theory this would not have led to any new physical results. Here we 
shall examine the point, however, in order to make clearer the subsequent analysis 
of the inversion properties of 4-spinors. 

The operation of inversion does not alter the sign of the spin vector, or of any 
axial vector, and the spin component s, is therefore also unchanged in value. 
Hence it follows that inversion can change each component of the spinor w* only 
into a multiple of itself: 


pS > Pw", (19.1) 


where P is a constant factor. On repeating the inversion, we return to the original 
coordinates. For a spinor, however, a return to the original position can be 
regarded in two different ways, as a rotation through 0° or 360°. These two 
definitions are not equivalent with respect to spinors, since w* changes sign on 
rotation through 360°. Thus two alternative views of inversion are possible: one 
where 


Poe. P=+1, (19.2) 
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and one where 
| PS ae, (19.3) 


Here it is important to note that the concept of inversion must be defined in the 
same way for all spinors. It is not permissible for different spinors to behave 
differently under inversion (i.e. in accordance with both (19.2) and (19.3)), since in 
that case it would not be possible to construct a scalar (or a pseudoscalar) from 
every pair of spinors: if the spinor &%“ were transformed according to (19.2), and ¢* 
according to (19.3), then the quantity /“d, would be multiplied by +i under 
inversion, instead of remaining constant (or simply changing sign). 

It should be emphasized that (whatever the definition of inversion) the assign- 
ment of a particular parity P to a spinor has no absolute significance, since spinors 
change sign on rotation through 27, and this can always be carried out 
simultaneously with inversion. The “‘relative parity’ of two spinors, defined as the 
parity of the scalar /“d, formed from them, has absolute significance, however; on 
rotation through 27, both spinors change sign, and the indeterminacy therefore 
does not influence the parity of this scalar. 

Let us now go on to discuss four-dimensional spinors, first noting that inversion 
changes the sign of only three coordinates x, y, z out of four x, y, Z, t; it 
therefore commutes with spatial rotations but not with transformations which 
rotate the t-axis. If L is the Lorentz transformation to a frame of reference moving 
with velocity V, then PL = L’P, where L’ is the transformation to a frame moving 
with velocity —V. 

Hence it follows that the components of the 4-spinor €* cannot be transformed 
into multiples of themselves under inversion. If the inversion of the spinor €* were 
given by the transformation (19.1) as before (i.e. if it were represented by a matrix 
proportional to the unit matrix), it would commute with every Lorentz trans- 
formation, and this certainly cannot be true, since the operations L and L’ are not 
the same when applied to &°. 

Thus inversion must transform the components of the spinor &* into expres- 
sions involving other quantities. The latter can only be the components of some 
other spinor n“ whose transformation properties are not the same as those of &*. 
Since inversion does not affect the z-component of the spin (as mentioned above), 
the components é' and &* can only become nj and 7 on inversion, these cor- 
responding to the same values s, —4 and s, = —3}. If inversion is taken to be an 
operation which gives identity when carried out twice, its effect may be expressed 
by the formulae 


co Pe; heres: (19.4) 


For the covariant components €, and contravariant components 7°, these trans- 
formations change sign: 


by — 7%, n> ames (19.4a) 


since the lowering and raising of the same index lead to opposite signs (cf. (17.5) 


70 Fermions 819 


and (17.9)).+ If, however, inversion is taken in the sense such that P’?= —1, its 
effect is given by 


eo as Na > 1€" (19.5) 
or, equivalently, 


E> —in®, nt > —i&. (19.5a) 


There is a certain difference between the two definitions of inversion in that 
with the second definition complex-conjugate spinors are transformed in the same 
manner: if 2, = n*, H% = €**, then by (19.5) =, > -—iH*, H* > —i3,, i.e. the rule is the 
same as for &,, n*. According to the definition (19.4), however, we should obtain 
=,-H*, H*—2,, which is opposite in sign to the transformation of the spinors &,, 
7*. We shall return in §27 to some possible physical consequences of this 
difference. 

In the following, the definition (19.5) will be used. 

The spinors €* and 7, are, as we know, transformed in the same way by the 
rotation subgroup. On taking the combinations 


Laine, (19.6) 


we obtain quantities which are transformed under inversion according to (19.1) 
with P = +i. These combinations, however, do not behave as spinors under all the 
transformations of the Lorentz group. 

Thus the inclusion of inversion in the symmetry group makes necessary the 
simultaneous treatment of a pair of spinors (€*, nz); this is called a bispinor (of 
rank one). The four components of a bispinor form a realization of one of the 
irreducible representations of the extended Lorentz group. 

The scalar product of two bispinors (€°, n.) and (=°, H,) can be formed in two 
ways. The quantity 


Cae yg: (19.7) 
is unchanged by inversion, i.e. it is a true scalar. The quantity 
€°E  — naH* (19.8) 


is also invariant under rotations of the 4-coordinates, but changes sign under 
inversion, i.e. it is a pseudoscalar. 

A spinor of rank two, °°, may also be defined in two ways. If it is defined by 
the transformation rule 


cab — go 4 Bey (19.9) 


+ The definition (19.4) is, of course, to some extent arbitrary, since the quantities €“ and 7. are 
independent. For instance, if na is replaced by a new spinor 4 = ena, (19.4) is replaced by the equivalent 
definition 


-i 


f f . 
E*>e° y., Naveré. 
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we obtain quantities which are transformed under inversion as follows: 
POP alee: (19.10) 


The 4-vector a” to which such a spinor is equivalent is transformed, according to 
(18.1), by (a°,a)—(a°,—a), ie. it is a true 4-vector, and the three-dimensional 
vector ais a polar vector. | 

It is also possible, however, to define ¢°° thus: 


ce ~ tH! — Een? (19.11) 
Thent 
C!s — Lig, (19.12) 


Such a spinor corresponds to a 4-vector such that under inversion (a°, a) (—a’, a), 
i.e. a 4-pseudovector (the three-dimensional vector a being an axial vector). 
Symmetrical spinors of rank two, with indices of the same type, are defined by 


Cetera ee <a yll a  yglle: (19.13) 
On inversion they are transformed into each other: 
fe aaa: (19.14) 


The pair (€°°, nag) forms a bispinor of rank two. It has 3+3=6 independent 
components. The antisymmetric 4-tensor of rank two a”” also has this number of 
independent components. There must therefore be a certain correspondence be- 
tween the bispinor and the tensor; both are realizations of equivalent irreducible 
representations of the extended Lorentz group. 

Since the spinors €** and ng are transformed independently by the proper 
Lorentz group, we can construct from the components of the 4-tensor a’” two 
groups of quantities which are transformed only into combinations of one another 
under any rotation of the 4-coordinates. This division is achieved as follows. 

We define a three-dimensional polar vector p and a three-dimensional axial 
vector a related to the components of the 4-tensor a”” by 


0 pr Py Dz 
ate = = (p, a), (19.15) 


where (p,a) is a concise notation which we shall use in order to specify the 


+ It must be emphasized that the transformation rules (19.10) and (19.12), which differ in the sign on 
the right, are not equivalent, since components of the same spinor appear on both sides (cf. the last 
footnote). 
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components of such a tensor. Then a,, = (—p, a), and, of the two quantities 
1 =a 
a—p = 74a,,a"", a-p=seup aa”, 


the first is a scalar and the second a pseudoscalar; both are invariant under the 
proper Lorentz group. The squares of the three-dimensional vectors f* = p+ ia are 
therefore also invariant. Thus any rotation in 4-space is equivalent, as regards the 
vectors f, to a “rotation” in 3-space, through angles which are in general complex; 
the six angles of rotation in 4-space correspond to three complex “angles of 
rotation” of the three-dimensional coordinates. The operation of spatial inversion 
changes the sign of p but not that of a, and converts the vectors f° and —f into 
each other. The components of these vectors are the required two groups of 
quantities formed from the components of the tensor a”’. 

This also makes evident the correspondence between the components of the 
4-tensor a“” and the spinors €°°, nag. Since the Lorentz group contains as a 
subgroup the spatial rotations, the relations between the components of the spinor 
and those of the three-dimensional vector must be the same as for three-dimensional 
spinors: 


fr=e?-e"), fe =hEP +e), ff =e": 
(19.16) 


fr=2.ne8-nid, fy =3imst nid, f= ni. 


PROBLEM 


Derive the general correspondence between spinors of even rank and 4-tensors. 


SOLUTION. All spinors for which k+I is even are realizations of single-valued irreducible 
representations of the extended Lorentz group, and are therefore equivalent to the 4-tensors which are 
realizations of similar representations.t 

A spinor of rank (k, k) can be defined so that it is transformed under inversion by 


an aa et ee ee (1) 


Such a spinor is equivalent to a symmetrical irreducible 4-tensor of rank k, which is a true tensor or a 
pseudotensor according to the sign in (1). 
Spinors of ranks (k, 1) and (1, k), forming a bispinor, are transformed under inversion by 


k 
ee 


7 1 _ 
ies) ally Ce ee (2) 
k l 


When | = k + 2, the bispinor is equivalent to an irreducible 4-tensor Qjuvjoc... of rank k + 2, antisymmetric 
in the indices [xv] and symmetric in all the other indices. The irreducibility of this tensor signifies that it 
gives zero on contraction with respect to any pair of indices and on dualization with respect to any 
three indices (i.e. e*””’ dtyvjpo.. = 0); the latter condition implies that the result is zero on taking the cyclic 
sum over three indices, wv and any one other. 

When | = k + 4, the bispinor is equivalent to an irreducible 4-tensor a@payjppjor... of rank k + 4, having the 
following properties: it is antisymmetric in the pairs of indices [Ay] and [vp], symmetric in all others, 
symmetric for interchange of [Aw] with [vp], and gives zero on contraction with respect to any pair of 
indices and on dualization with respect to any three indices. 

Generally, when |=k+2n, the bispinor is equivalent to an irreducible 4-tensor of rank k + 2n, 
antisymmetric in n pairs of indices and symmetric in the other k indices. 4-tensors antisymmetric in 


+ Spinors of odd rank are realizations of two-valued representations of the group: a spatial rotation 
through 360° changes the sign of spinors, so that two matrices of opposite sign correspond to each 
element of the group. 
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larger numbers (threes, fours, etc.) of indices do not appear in this classification, for the obvious reason 
that an antisymmetric tensor of rank 3 is equivalent (dual) to a pseudovector, and an antisymmetric 
tensor of rank 4 reduces to a scalar (is proportional to the unit pseudotensor e*“”’); antisymmetry in a 
still greater number of indices is not possible in 4-space. 


§20. Dirac’s equation in the spinor representation 


A particle with spin 3 is described, in its rest frame, by a two-component wave 
function, i.e. a three-dimensional spinor. The “four-dimensional origin” of this may 
be either an undotted or a dotted 4-spinor. Both these 4-spinors appear in the 
description of the particle in an arbitrary frame of reference; we shall denote them 
by €* and 74.T 

For a free particle, the only operator which can appear in the wave equation is 
(as shown in 810) the 4-momentum operator p, = id,. In the spinor notation, this 
4-vector corresponds to the operator spinor p,g, with 


p'! = pu = p, + Bo, p” = pri = Po— Ba | 
A 4 ral Fal cA mw | A A A 20.1 
p= — poi =px—ipy, pp” = — bo = py + ipy.. eu) 


The wave equation is a linear differential relation between the components of 
spinors, expressed by the operator p,g. The requirement of relativistic invariance 
leads to the equations 


png = an 
a (20.2) 
Dias" = MQ, 

where m is a dimensional constant. There would be no meaning in using different 
constants m; and m, here, or in changing the sign of m, since the equations could 
still be reduced to the above form by an appropriate transformation of &€°* or 7,4. 

By substituting »g from the second equation (20.2) in the first, we can eliminate 
one of the two spinors: 


pp pe? = me’. 


1 
AaB, . — + 
| aT: a 


From (18.4), pp, = p*5%, and thus we obtain 
(p*— m*)é” = 0, (20.3) 


whence it 1s evident that m is the mass of the particle. 

It should be noticed that the need to use the mass in the wave equation implies 
the simultaneous consideration of two spinors (€* and 7,): with only one of these, 
it would not be possible to construct a relativistically invariant equation containing 


+ A three-dimensional spinor of rank one may also “originate” from 4-spinors of higher odd rank 
which, in the rest frame, become antisymmetric in one or more pairs of indices. These would, however, 
lead to higher-order equations (cf. the third footnote to §10). 
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a dimensional parameter. The wave equation is necessarily invariant under spatial 
inversion if the transformation of the wave function is defined by 


Poh Pas Ne Ple". (20.4) 


It is easily seen that the two equations (20.2) are interchanged by this substitution 
(together with p°*—>p,g, which is evident from (20.1)). Two spinors which are 
interchanged by inversion form a four-component quantity, a bispinor. 

The relativistic wave equation given by (20.2) is called Dirac’s equation, having 
been first derived by Dirac in 1928. In order to analyse und apply this equation 
further, let us consider various ways in which it may be written. 

Using (18.6), we can rewrite equations (20.2) as 


(po+p:a)n = “| 
(20.5) 


(po— p+ a)é = m7. 


Here the symbols é and 7 denote two-component quantities, the spinors 


(a=) one 


(the first with upper and the second with lower indices). Here and below, 
multiplication of the matrices 0 by any two-component quantity f means multi- 
plication by the usual matrix rule: 


(of a = Tupf p. (20.7) 


The vertical column notation for f is in accordance with the multiplication of each 
row of o by the column f. 
For subsequent reference, the Pauli matrices may be written once more; 


n= (5 a 0 =(5 rr) n= (6 =) (20.8) 


Their fundamental properties are 


OiO~n + O,0;1 = 285i, 
(20.9) 


Oi = lei, + ik; 


see QM, §55. 
We shall also give the wave equation satisfied by the complex-conjugate wave 
function formed from the spinors 


é*=(E*, 2%),  n* =(n*, 08). (20.10) 


Since all the operators p, contain the factor i, p% = — p,. In taking the complex 
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conjugate of both sides of equation (20.5), we must also use the fact that, since the 
matrices o are Hermitian (o* = G), 


(of )% = Oxpl p= {Boge _ (f*o)a3 


the resulting equations are 


n*(po+ p: a) =—mEé ; eon 


E*(Bo— p+ 0) = —mn*. 


Here it is conventionally implied that the operators p” act on the function to the 
left of them. The writing of €* and 7* as horizontal rows is in accordance with the 
matrix multiplication in these equations: the row f is multiplied by the columns of 
the matrix a, 


(f*o)a = fi Opa- (20.12) 


The inversion transformation for €*, 7* is defined as the complex conjugate of 
the transformation (20.4): 


P: é*>-in;, 9,2 -ié**. (20.13) 


§21. The symmetrical form of Dirac’s equation 


The spinor form of Dirac’s equation is the most natural one, in the sense that its 
relativistic invariance is immediately apparent. In applications of the equation, 
however, other forms of the wave equation may be more convenient, which are 
obtained by a different choice of the four independent components of the wave 
function. 

We shall denote the four-component wave function by the symbol ys, with 
components yy; (i = 1, 2, 3, 4). In the spinor representation, it is a bispinor: 


Gz =) (21.1) 


But the independent components of w can equally well be taken as any linearly 
independent combinations of components of the spinors € and 7.+ We shall 
arbitrarily limit the acceptable linear transformations by the one condition of 
unitarity; such transformations leave unchanged the bilinear forms constructed 
from w and wW* (§28). 

In the general case of an arbitrary choice of the components of , Dirac’s 
equation can be put in the form 


Puy We = mui, 


+ For brevity, the four-component quantity yw will be referred to as a bispinor even in non-spinor 
representations. 
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where y" (yw =0, 1, 2, 3) are certain four-rowed matrices (Dirac matrices). We shall 
usually write this equation in a symbolic form, omitting the matrix indices: 


(yp —m)b =0, (21.2) 
where 
YP = y"B, = Boy —P°y¥ 
d 
= 1 ae 1 e 
iy a +iy:V, 
y=(y,77,7)- 


For example, the spinor form of the equation with the components of w as in 
(21.1) corresponds to the matricest 


vs) 0%) a3 


as is easily seen by writing the equations (20.5) as 


Pate rere (Eyam (*) 


and comparing with (21.2). 
In the general case, the matrices y need satisfy only conditions ensuring that 
p’ =m’. To find these conditions, we multiply equation (21.2) on the left by yp: 


(yy Bb = m(pry")b = m*y. 


Since p,p, is a Symmetrical tensor (all the operators p, commute), this equation 
may be rewritten 


Dub lyty’ + y’y" yb = m7, 
and we must therefore have 
Ye Ie ee (21.4) 
Thus all the pairs of different matrices y" anticommute, and their squares are 
Gy =6) =O) ==, Grek (21.5) 


Under an arbitrary unitary transformation of the components ws: ww’ = Uw, where 


t+ Here and below, we use a compact two-rowed notation for four-rowed matrices. Each symbol in 
(21.3) represents a two-rowed matrix. 
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U is a unitary four-rowed matrix, the matrices y are transformed as follows: 
y' = UyU "= UYU", (21.6) 


so that the equation (yp —-m)w=0 becomes (y’p — m)W'=0. The commutation 
relations (21.4) remain unchanged, of course. 

The matrix y° (21.3) is Hermitian, and the matrices y are anti-Hermitian. These 
properties are preserved under any unitary transformation (21.6), and we therefore 
always havet 


yo=—y, inte te (21.7) 


The equation for the complex-conjugate function w#* may also be given. Taking 
the complex conjugate of equation (21.2) and using the properties (21.7), we obtain 


A ~ 


(—poyo- Pp ¥y—m)w* = 0. 


We commute w* by y*W* = W*y" and then multiply the whole equation on the right 


by y°; since yy’ = — y’y, we have in terms of a new bispinor 
P= pry, w= by (21.8) 
the result 
W(yp + m)=0. (21.9) 


As in (20.11), the operator p is here taken to act on the function to its left. The 
function & is called the Dirac conjugate (or relativistically conjugate) function to w. 
The factor y° in its definition signifies that (in the spinor representation) it 
interchanges the spinors €* and *; thus, in & = (y*, é*) the first spinor is undotted 
(as in &) and the second is dotted. For this reason # is a more natural “‘partner”’ of 
yw than wW* is; they appear together, for instance, in various bilinear combinations 
(see §28). 
The inversion transformation for the wave function may be written as 


P: poiy’t, b= —iy’®. (21.10) 


In the spinor representation of w, the matrix y° interchanges the components € and 
ym, as Should happen on inversion. The invariance of Dirac’s equation under the 
transformation (21.10) in the general case is immediately obvious: changing p into 
—p and w into iy°w in equation (21.2), we have 


(Boy’ +p: y—m)yb = 0. 


Multiplying this equation on the left by y° and taking into account the fact that y” and y 
anticommute, we return to the original equation. 
+ These equations may be written jointly in the form 
A 


+ 0.2.0 
6 cami: Se Ea Ae 


QE4-G 
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_ Multiplying the equation (yp —m)=0 on the left by y, and the equation 
W(yp + m)=0 on the right by w, and adding, we obtain 


bry" (pith) + (Buh)y"h = (prby"b) = 0, 


where the parentheses indicate the function on which the operator p acts. This 
equation is in the form of an equation of continuity, 0,j* = 0, so that 


j* = py" 
= (bp, W*yiyy) (21.11) 
is the particle current density 4-vector. Its time component j°= w*u is positive- 
definite. 


Dirac’s equation may be put in the form of an expression for the time 
derivative: 


i aw/at = Hy, (21.12) 


where H is the Hamiltonian of the particle.t To obtain this form, we need only 
multiply equation (21.2) on the left by y°. The resulting expression for the 
Hamiltonian is 


A 


H =a-p+ Bm, (21.13) 
where 
a=y'y, B=y° (21.14) 


is the customary notation for the matrices concerned. 
It may be noted that 


Qi, + A.A; = 265i, Bat+ap =0, Bo= 1, (21.15) 


i.e. all the matrices a, B anticommute and their squares are unity; they are all 
Hermitian. In the spinor representation, 


a= (5 -) B=(; 5) (21.16) 


In the limit of small velocities the particle must be described, as in the 
non-relativistic theory, by a single two-component spinor: on taking the limit p— 0, 
€—>m in equations (20.5), we find € = n, so that the two spinors which form the 
bispinor are equal. This, however, reveals a defect of the spinor form of Dirac’s 


+ For a particle with spin zero, the wave equation was not capable of being written in this form: the 
equation (10.5) for the scalar w is of the second order in the time, while the first-order equations (10.4) 
for the five-component quantity (W, w,) contain the time derivatives of only some of the components. 
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equation: in the limit, all four components of Ww are non-zero, although only two of 

them are really independent. A more convenient representation of the wave 

function Ww would be one in which two of its components were zero in the limit. 
Accordingly, we replace € and 7 by linear combinations ¢ and y: 


(21.17) 
b =alé + 0); X = y75{é — 0). 


Then y = 0 for a particle at rest. This will be called the standard representation of 
yw. On inversion, ¢@ and y are transformed as follows: 


P: ¢-i¢, x > — iy. (21.18) 
The equations for ¢@ and y are obtained by adding and subtracting equations 


(20.5): 


dane | (21.19) 


— Pox + p+ od = mx. 


Hence we see that the standard representation corresponds to the matrices 


reo=) 9 (29) aC etm 


Since the first and second components of € and n are added separately in 
(21.17), the components w, and 3 correspond to the spin component eigenvalue +3 
in both the standard and the spinor representation, and yw and yy to —3. In both 
representations, therefore, the matrix +>, where 


y= (2 ) (21.21) 


in a three-dimensional spin operator: when 33, acts on a bispinor containing only 
the components WW, 3, or wW, YW, this bispinor is multiplied by +) or —3. In an 
arbitrary representation, (21.21) may be written in the form 


= —ay = — tia X OL; (21.22) 


the definition of y° is given in (22.14) below. 


PROBLEMS 


PROBLEM 1. Find the formulae giving the transformations of the wave function under an infinitesi- 
mal Lorentz transformation and an infinitesimal three-dimensional rotation. 
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SOLUTION. In the spinor representation of yw, an infinitesimal Lorentz transformation gives 
é=(1-20-8V)& = (1 +20 8V)n; 

see (18.8), (18.8a), (18.10). These formulae may be combined as 

' = (1-20 + BV). (1) 
Similarly, the transformation under an infinitesimal rotation is 

w= (14+ rid + 0)u. (2) 
In this form the results are valid for any representation of w if @ and & are matrices in that 
representation. 


It is easily verified that the matrices @ and % are the components of an antisymmetric 
4-tensor’’, 


& 


‘matrix 


ot” =Ay"y" — yy") = (a, 1%); 


the components are arranged as shown in (19.15). Using also the infinitesimal antisymmetric tensor 
de”” = (SV, 68), we have 


o””’Se,, = 21% - 60—2a- SV, 
and formulae (1) and (2) above may be combined as 
Wl = (1+ 40"Seuy)u. (3) 


PROBLEM 2. Write Dirac’s equation in a representation such that it contains no imaginary 
coefficients (E. Majorana, 1937). 
SOLUTION. In the standard representation, the only imaginary quantities in the equation 
0 0 0 0 
—+a;,— + ay—+a—Fti =0 
(<. “ox ay az iB) 


are the matrices ay and iB. These may be eliminated by a transformation w'= Uw such that the 
imaginary matrix ay and the real matrix B are interchanged. This is achieved by putting 


U = <5 (ay + B)= U's 


then a, = Uax,U = — a, a= B, a: = — az, B’ = ay, and Dirac’s equation becomes 


in which all the coefficients are real. 


§22. Algebra of Dirac matrices 

In calculations using Dirac’s equation, the matrices y occur repeatedly without 
reference to their specific form in any particular representation. The rules of operation 
with these matrices are entirely given by the commutation relations 


yey ay = Dek i. y = 0,1, 2,3), (22.1) 


which determine all their general properties. 
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In this section we shall give various formulae and rules of the algebra of these 
matrices which are useful in such calculations. 

The “scalar product” of the matrices y with themselves is g,,y"y’ =4. For 
brevity we use the notation y, = g,,y” by analogy with the covariant components 
of 4-vectors. Then 


yay" =4. (2222) 
If the matrices y, and y” are separated by one or more factors y, then they can be 


brought to adjoining positions by one or more interchanges using the rule (22.1), and 
the summation over p is then carried out by means of (22.2). This yields the formulae 


yyy" = —2y", 
Wy yy" = 4g”, 5) 
Wy y’yeyt = —2yPy*y", ) 
9 To a A aie Ab Ailey ae he dan ie 1s ie a 
The factors y", etc., usually appear in combination with various 4-vectors as 
“scalar products” with the latter, 

ya = y"a,. (22.4) 

For such products, formulae (22.1) become 


(ay)(by) + (by)(ay) = 2(ab), | 


(ay)(ay) = a’, Ce 
and formulae (22.3) become 
y.lay)y" = —2(ay), 
Yulay)by)y" = 4(ab), 
(22.6) 


Ylay)by)lcy)y* = —2(cy)(by (ay), 
Ylay)by)(cy)(dy)y" = 2[(dy)(ay)(by (cy) + (cy)(by (ay )(dy)]. 


A frequent operation is taking the trace of the product of a number of 
matrices y. Let us consider the quantities 


T Mika Bn se ! tr (vty... yen), (22.7) 


On account of a familiar property of the trace of a product of matrices, this tensor 
is symmetrical with respect to cyclic permutations of the indices py, p2,..-, Mn 


+ In this edition, no special notation is used for such products. Letters with circumflexes or with strokes 
through them are often found with this meaning in the literature. 
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Since the matrices y have the same form in any frame of reference, the 
quantities T are also independent of this frame, and they therefore form a tensor 
which can be expressed entirely in terms of the metric tensor g,,, which has this 
property. 

From the tensor g,, of rank two, however, only tensors of even rank can be 
constructed. Hence it follows immediately that the trace of the product of any odd 
number of factors y is zero. In particular, the trace of each y is zero:T 


try" =0. (22.8) 


The trace of a unit four-rowed matrix (which is implied on the right-hand side of 
the commutation rule (22.1)) is 4. Thus, if we take the trace of both sides of (22.1), 
we find 


TY’ = gi” (22.9) 
The trace of the four-matrix product is 
T MY = gig? — gitgiP 4 giPgh, (22.10) 


This formula may be derived, for instance, by “‘pulling’’ the factor y* in tr y“y“y’y? 
to the right by means of the relation (22.1); after each interchange one of the terms 
in (22.10) appears: 


T Mev an Pe ala vp __ Tare 
= 22g" me A cd 
and so on. After all the interchanges there remains on the right -T””* = —T**’”, 


which we take to the left-hand side. The trace of a product of six y can similarly be 
reduced to the traces of four-factor products, and so on. For instance, 


T huvpoT — ge T vpoT __ oper He ee pee aa dl as + oe Ler. (22.11) 


All the traces T*”: are real, and they are non-zero only if each of the matrices 
y°’, y',... appears in the product an even number of times; both these results are 
obvious from the above formulae. Hence we easily find that the trace is unchanged 
when the order of the factors is reversed: 


Ta PF me TP BA (22.12) 


As already mentioned, the factors y usually appear as “scalar” products with 
various 4-vectors. In such cases, formulae (22.9) and (22.10), for example, become 
4 tr(ay)(by) = ab, 


(22.13) 
4 tr(ay)(by (cy (dy) = (ab )(cd) — (ac)(bd) + (ad)j(bc). 


+ The trace of a matrix is invariant under the transformations y = UyU™'. Thus the result (22.8) is 
also evident from the expressions (21.3) for the matrices. 
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The product y°y'y’y’ is of particular importance. There is a special notation for 


it which is customarily used: 
P= yey aay (22.14) 
It is easily seen that 
yytty'y=0, (WyY=l], (22.15) 


i.e. the matrix y° anticommutes with all the y”. For the matrices « and 8, the rules 
are 


ay —ya=0, By'+yB =0; ee18) 


the commutability with a follows because a= yy is a product of two matrices y". 
The matrix y° is Hermitian, since 


34+ 2+ 14+ 0+ i De eee Be 


yr siyty yy = ivy yy’, 


and hence 
yay’, (22.17) 


because the sequence 3210 is changed to 0123 by an even number of transpositions. 
The form of this matrix in two particular representations is: 


spinor = . 7 
° : one. (22.18) 


0 —-!1 
5 os 
Standard y & Al) 
The trace of the matrix y° is zero: 


tr y°=0, (22.19) 


as can be seen directly from (22.18). The traces of the products y’y"y” are also 
zero. For the products of y° with four factors y” we have 


itr yy *y"y’y? = ie”. (22.20) 
Another formula is 
yN = iy*(ya)(yb)(yc), NP Sie" abies (22.21) 
which is valid for mutually normal 4-vectors a, b, c: 


ab = ac = bc = 0. 
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In some cases (for problems involving non-relativistic particles), it may be 
necessary to calculate the traces of products which involve y° and the three- 
dimensional “‘vector”’ y separately. The only non-zero traces are those of products 
containing even numbers of factors y’ and y. All the factors y® become unity, and 
the traces of products with two and four factors y are respectively 


i tr(a: y)(b+y)= —a°b, 


str(a+ y)(b+ y)(e + y)(d + y) = (a: b)(e- d)— (a+ c)(b: d)+ (a: d)(b- ce). 
(22.22) 


§23. Plane waves 


The state of a free particle having definite values of the momentum and energy 
is described by a plane wave which may be written in the form 


I * —ipx 
Wp = A /(2e) Ul : (23.1) 


The suffix p indicates the value of the 4-momentum; the wave amplitude u, is a 
suitably normalized bispinor. In proceeding with second quantization we need 
not only the wave functions (23.1) but also functions with a “negative frequency’, 
which arise in the relativistic theory because of the two-valuedness of the square 
root +V(p?+ m7), as shown in §11. As in §11, we shall always take ¢ to be the 
positive quantity «=+V(p’+m’), so that the “negative frequency” is —e; on 
changing also the sign of p, we obtain a function which may naturally be called w_,: 


a 3) ue. (23.2) 


The significance of these functions will be explained in §26; here we shall write 
parallel formulae for w%, and w_,. 
The components of the bispinor amplitudes u, and u_, satisfy the algebraic 
equations 
(yp —m)u, = 0, 
: (23.3) 


(yp +m)u_, = 0,. 
which are obtained by substituting (23.1), (23.2) in Dirac’s equation (this is 


equivalent to replacing the operator p in that equation by +p).t The relation 
p’ =m?’ is then the condition for each such pair of equations to be compatible. We 


+ There are also similar equations obtained from Dirac’s equation (21.9) for the complex-conjugate 
function: 


up(yp-m)=0, u-p(ypt+m)=0. (23.3a) 
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shall always normalize the bispinor amplitudes by the invariant conditions 


Upu,y = 2m, 
a (23.4) 


U_pu_p = —2m, 


where the bar over a letter denotes, as usual, Dirac conjugation: ai = u*y°. Multiplying 
equations (23.3) on the left by a.,, we obtain (i.,yu.,)p = 2m’ = 2p’, whence 


Upyu, = U_pyu_p, = 2p. (23.5) 


It may be noted that the change from the formulae for u, to those for u_, is made 
by changing the sign of m. 
The current density 4-vector is 


are ae ies 
j= Wp YWxp = ae UspYusp = ple, (23.6) 


i.e. J” =(1,v), where v=p/e is the velocity of the particle. Hence we see that the 
functions y, are “normalized to one particle in the volume V = 1”’. 

Equations (23.3) show that the components of the wave amplitude are related, 
but the actual form of the relations depends, of course, on the specific represen- 
tation of w. For the standard representation they are found as follows. 

From equations (21.19) we have, for a plane wave, 


(e—-m)h —p: ox = 0, 
(23.7) 
(e+m)y —p:od = 0. 
From these we find the relation between ¢ and y in two equivalent forms: 
_ pg ee ee 
earners « = ees (23.8) 


their equivalence is evident on multiplying the first form on the left by p: o/(e +m) 
and using the results (p- o)’= p’ and e*— m’=p’, which gives the second form. 
The common factor in ¢@ and y is chosen to satisfy the normalization condition 
(23.4). Thus we obtain for u,, and correspondingly for u_,, the expressions 


= V(e+m)w _ V(e-—m)n-o)w! 
a (Vie —m)(n- aw) — toe +m)w' ) (23.9) 


the second formula is obtained from the first by changing the sign of m and 
replacing w by (n- o)w’. Here n is a unit vector in the direction of p, and w ts an 


arbitrary two-component quantity subject only to the normalization condition 


w*w =|. (23.10) 
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For a = u*y° (with y° from (21.20)) we have 


ip = (V(e + m)w*,-V(e — m)w*(n- o)), | 


i_p =(V(e —m)w'*(n- a), -— V(e + m)w'*), ee") 
and multiplication shows that in fact i.,u+) = +2m. 
In the rest frame (e = m), we have 
w 0 
u, = Veam)(9 ), u_p = V2m) @) (23.12) 


i.e. w is the three-dimensional spinor to which the amplitude of each wave reduces 
in the non-relativistic limit. In the bispinor u_,, the first two components, not the 
second two, vanish in the rest frame. This property of solutions of Dirac’s equation 
having “‘negative frequencies”’ 1s evident, since by putting p = 0 and replacing « by 
—m in (23.7), we find ¢@ = 0.T 

The amplitude of the plane wave contains one arbitrary two-component quan- 
tity. Thus, for a given momentum, there are two different independent states, 
corresponding to the two possible values of the spin component. But the spin 
component along an arbitrary z-axis cannot have a definite value. This is evident 
because the Hamiltonian of a particle with definite p (i.e. the matrix H =a:p+ 
Bm) does not commute with the matrix {, = —ia,a,y. In accordance with the general 
conclusions of $16, however, the helicity A (the component of the spin in the 
direction of p) is conserved: the Hamiltonian commutes with the matrix n> &. 

Helicity states correspond to plane waves in which the three-dimensional spinor 
w = wn) is an eigenfunction of the operator n- o: 


s(n : o)w = hw). (23.13) 


The explicit form of these spinors is 
12 
werd = ee cos -) 
e2* sin3@ /” 


lig ° 1 
word = & 2” SIN ) 2 
_ 1. ; 
e** cos 36 /’ 


(23.14) 


where @ and @ are the polar angle and the azimuth of the direction of n relative to 
fixed axes xyz.t 

Another possible choice of the two independent states of a free particle with 
given p, which is simpler but less clear, corresponds to the two values of the 
z-component of the spin in the rest frame, which we denote by o. The spinors are 


1 1 aot 0 
(o=7) — (o=-)) — 
w Gy w (;). (23.15) 


+ In the spinor representation € = — y, instead of €=7 as in the rest frame for solutions having 
“positive frequencies”. 

t The solution of equation (23.13) can be multiplied by any phase factor, because of the possibility 
of an arbitrary rotation about the direction of n. 
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As the two linearly independent solutions with “negative frequency” we take plane 
waves in which the three-dimensional spinors are 


w= — gw”? = 2aiw™; (23.16) 
the significance of this choice will be shown in §26. 

We can also find a representation of a plane wave such that in any frame of 
reference (not only in the rest frame) the wave has only two components cor- 
responding to definite values of the same physical property—the spin component in 
the rest frame (L. Foldy and S. A. Wouthuysen, 1950). 

Starting from the amplitude u, (23.9) in the standard representation, we seek a 
unitary transformation to such a representation in the form 


u} = Uu,, Use", 


where W is real; since y* = —y, it follows that U* = U™'. Expanding in series and 
noting that (y - n)? = —1, we put 


U=cosW+t+y-nsin W; 


cf. the derivation of (18.14) from (18.13). The condition that the second pair of 
components in the transformed amplitude u, should be zero gives 


tan W = |p|/(m + «), 
so that 
U _mt+et(y-n)pl 
[2e(e +m)] 
In the new representation, 


ws = Ve) (i). (23.17) 


The Hamiltonian of the particle in this representation is 
H'=U(a:p+Bm)U'= Be, (23.18) 


where all the matrices B,a and y belong to the standard representation. This 
Hamiltonian commutes with the matrix 


which is, in the new representation, the operator of a conserved quantity, the spin 
in the rest frame. 
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§24. Spherical waves 
The wave functions of states of a free particle (with spin 3) having definite 
values j of the angular momentum are spinor spherical waves. To determine their 


form, let us first state the corresponding formulae of the non-relativistic theory. 
The non-relativistic wave function is a three-dimensional spinor 


For a state having definite values of the energy e (and therefore of the momentum? 
p), the orbital angular momentum I, the total angular momentum j and its com- 
ponent m, the wave function is 


wy m Rp (1) Qjm (8, b). (24.1) 


The angular functions Q;,, are three-dimensional spinors whose components (for 
the two values j = | +34 which are possible for a given 1) are 


Oust im = 


(24.2) 


H+2 chee, 


(see QM, $106, Problem). We shall call the OQjm spherical harmonic spinors. They 
are normalized by the condition 


| 2% in Qprm do = 5,60 Omm'- (24.3) 


The radial functions R,,; are the common factor in the two components of the 
spinor Ww, and are given by 


2 
Rp = a Jist(pr) (24.4) 


(QM, (33.10)). They are normalized by the condition 


| r’RpiR, dr = 2778(p'— p). (24.5) 
0 


+ In this section, p denotes |p]. 
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Returning now to the relativistic case, let us note first of all that separate laws 
of conservation of spin and orbital angular momentum do not exist for a moving 
particle: the operators § and I do not separately commute with the Hamiltonian. 
But the parity of the state is still conserved (for a free particle). The quantum 
number | therefore no longer refers to a definite value of the orbital angular 
momentum, but it defines the parity of the state (see below). 

Let us consider the required wave function (bispinor) in the standard represen- 
tation: 


Under rotations, @ and y behave like three-dimensional spinors. Their angular 
dependence is therefore given by the same spherical harmonic spinors Qjm. Let 
gh  Qim, Where | is a certain one of the two values j +5 and j —}. Under inversion 
o(r) > id(—r) (see (21.18)), and Ojim(—n) = (—1)'Qjim(n), so that 


b(r) > i(—-1)'o(r). 


The components y(r), under inversion, become —iy(—r). In order that the state 
should have a definite parity (i.e. that all the components should be multiplied by 
the same factor on inversion), it is therefore necessary that the angular dependence 
in x should be given by the spherical harmonic spinor Qjm with the other of the 
two possible values of 1; since these two values differ by 1, (-1)' = —(-1)’. 

The radial dependence of ¢@ and y will be given by the same functions R,; and 
Ry (with the values of | and I’ which give the order of the spherical harmonics in 
Qjm). This is clear because each component of wW satisfies the second-order 
equation (p* — m’)# = 0, which for a given value of |p| becomes 


(A + p’)y =0, 
and this is formally identical with Schrodinger’s non-relativistic equation for a free 
particle. 
Thus 


db ma AR, Qym, x= BRp Qyrm; (24.6) 
and it remains to determine the constant coefficients A and B. To do so, we 


consider a distant region, where the spherical wave may be regarded as a plane 
wave. According to the asymptotic formula (QM, (33.12)), 


Roi ae + {ei tn) “2 eiPr-zml) (24.7) 


so that @ is the difference of two plane waves propagated in the directions 
+n (n=r/r). For each plane wave, by (23.8), 


ae ; 
x eh oom a). 
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From the previous results (formulae (24.6)) it is obvious that (n+ W)OQjm = AQjrm, 
where a is a constant. This constant is easily found by comparing the values of the 
two sides of the equation when m = 3 and nis along the z-axis. Using (7.2a), we find 


(n: F)Q jim i" Ojrm- (24.8) 


These formulae, on comparison with (24.6), show that 


Finally, the coefficient A is determined by the normalization of w. If this ts 
specified by 


| We imBpjum AX = 2778451Smm'5(p — p'), (24.9) 


we have 


= 1 Ve ae M)Rp Qim i ys 
Wojim = V(2e) ee _ eae l'=2j-l. (24.10) 

Thus, for given values of j and m (and of the energy e) there exist two states 
differing in parity. The parity is uniquely defined by the number |, which takes the 
values j+ 3: on inversion, the bispinor (24.10) is multiplied by i(—1)'. The com- 
ponents of this bispinor, however, contain spherical harmonics of both orders | and 
l’, showing that the orbital angular momentum has no definite value. 

When r—~, the spherical waves (24.7) may be regarded as plane waves in any 
small region of space, with momentum p= + pn. It is therefore clear that the wave 
functions in the momentum representation differ from (24.10) essentially only in 
that the radial factors are absent and n denotes the direction of the momentum. 

In order to make a direct change to the momentum representation, we must 
carry out a Fourier transformation: 


(p’) = | w(r) e'? '" d’x. (24.11) 


The integral is calculated by means of the expansion of a plane wave in spherical 
Waves: 


sie ame Ee 
err 2ZS SY ERWDY (?) Yin (=). (24.12) 


p 1=0 m=— 


With an expansion of this kind for e~'? "" in (24.11) and using (24.5), we find that the 
Fourier components of the function 


w(r) = Ry (rn) Qyim (r/r) 
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are 


vip) = 22 3(p'—p)i Ym (B) | Qim(E) Yir(E) ao 


The integral is equal to the coefficient of the spherical harmonic function in the 
definition (24.2) of the spherical harmonic spinors, and together with the factor 
Yim'(p'/p') it yields the same spherical harmonic spinor, but with argument p’/p’: 


7 


b(p') = 8(p'— p)i ‘Qum (E,). 


p 


Applying this result to the bispinor wave function (24.10), we obtain the 
momentum representation 


» = p(y" py 2M (We + mi Qin (D/P 
iO) = 80" =P) oy (Ve _ ree) (24.13) 


The states |pjlm) are the same as the states |pjm|A|) (with |A| = 3) discussed in 
$16: both have definite values of pjm and the parity. The spherical harmonic spinors 
Qjim are therefore related in a certain way to the functions DY, (both with argument 
p/p). When p > 0, the wave functions (24.13) reduce to the three-dimensional spinors 
Qi, the parity of which is P = n(—1)' (where 7 =i is the “internal parity” of the 
spinor). A comparison with the results of $16 gives the formula 


Qvitm =i itt (wD, = w® D1®) (24.14) 


where | = j +3, and the w™ are the three-dimensional spinors (23.14). 


§25. The relation between the spin and the statistics 


The second quantization of a field of particles with spin 3 (a spinor field) is 
carried out in a similar way to that of a scalar field in §11. 

Without repeating the arguments, we shall immediately write down expressions 
for the field operators, which are exactly analogous to (11.2): 


, l A —ipx ak ipx 
Y= 21 7G) (oo Une aac: ge | See a 
) (25.1) 
Pie ee 0 __ 1 At = ipx fo —ipx \. 
v=ry = (ap) Gowityoe Dae ewe «Py. 
p.c 


the summation is over all values of the momentum p and over o = +3. The 
antiparticle annihilation operators b,, (like the particle annihilation operators 4),) 
appear as the coefficients of functions whose coordinate dependence (e'?’‘) 
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corresponds to a state having momentum p.t 

To calculate the Hamiltonian of the spinor field, it is not necessary to determine 
the energy-momentum tensor (as we did for the scalar field), since in this case 
there exists a particle Hamiltonian which can be used to derive the wave equation 
(Dirac’s equation) (21.12). The mean energy of the particle in a state with wave 
function w is the integral 


[ w*Htya's =i | uct d°x 
Si | byt ax. (25.2) 


It should be noticed that the “energy density” (the integrand) is here not a 
positive—definite quantity. 

Replacing the functions » and w in (25.2) by w-operators, using the ortho- 
gonality of the wave functions with different p or o, and also using the relation 
spo’ Uspe =2e for the wave amplitudes, we obtain the field Hamiltonian in the 
form 


H => ¢(d;,4,, — 6,,b+,). (25.3) 


Hence it is seen that in this case Fermi quantization must be used: 
lanGe eed, Adee bi dead, (25.4) 


and all other pairs of operators 4, d*, b, b* anticommute (see QM, 865), since then 
the Hamiltonian (25.3) may be written 


H => ¢(a,4,. + bt,b,. — 1), 
P,o 


and the energy eigenvalues are (with the usual omission of an infinite additive 
constant) 
E =D) e(Noc + Noa): (25.5) 
p.7 


and are positive—definite, as they should be. With Bose quantization, we should 
obtain from (25.3) the eigenvalues 


YS. e(Npo — Noo)s 


which are not positive—definite and have no meaning. 


} The two functions also correspond to the. same value o of the spin component in the rest frame; 
for the functions w_p_—. this will be proved in §26 (see (26.10)). 
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An expression analogous to (25.5) is obtained for the momentum of the system, 


aA na A 


i.c. the eigenvalues of the operator f ys* pw d°x: 
P= > p(Noo + N po): (25.6) 
p,g 


The 4-current operator is 


j* = hyd, (25.7) 


and the “‘charge”’ operator of the field is found to be 


p, ~ 
= 2 (Apcdpo — bt Doo ts 1); (25.8) 
its eigenvalues are 
Q => (Noo — Nyc). (25.9) 
p, 7 


Thus we again arrive at the concept of particles and antiparticles, and the whole 
of the discussion of these in §11 is applicable. 

But particles with spin 3 are fermions, whereas those with spin zero are bosons. 
An examination of the formal origin of this difference shows that it is due to the 
different nature of the expressions for the “energy density” in the scalar and spinor 
fields. In the scalar field the expression is positive—definite, and the terms d@"d and 
bb* therefore both have a positive sign in the Hamiltonian (11.3). If the energy 
eigenvalues are positive, the replacement of bb* by b*b must occur without change 
of sign, i.e. in accordance with the Bose commutation rule. For the spinor field, 
however, the “energy density” is not a positive-definite quantity, and hence the 
term bb* appears with the minus sign in the Hamiltonian (25.3); to obtain positive 
eigenvalues, the replacement of bb* must be accompanied by a change of sign, i.e. 
must occur in accordance with the Fermi commutation rule. 

The form of the energy density is directly related to the transformation 
properties of the wave function and to the requirements of relativistic invariance. 
In this sense we may say that the relation between the spin and the statistics 
obeyed by the particles is likewise a direct consequence of these requirements. 

The fact that particles with spin 3 are fermions also leads to a general con- 
clusion: all particles with half-integral spin are fermions, and those with integral 
spin are bosons (including those with spin zero, as demonstrated in §11).7 


+ The origin of the relation between the spin of a particle and the statistics which it obeys was first 
elucidated by W. Pauli (1940). 


QE4-=H 
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This is evident because a particle with spin s may be regarded as “‘composed”’ 
of 2s particles with spin 3. When s is half-integral, 2s is odd; when s is integral, 2s 
is even. A “composite” particle containing an even number of fermions is a boson, 
and one containing an odd number of fermions is a fermion.f 

If a system consists of particles of various kinds, then creation and annihilation 
Operators must be defined separately for each kind of particle. The operators 
pertaining to different bosons, or to bosons and fermions, commute. Operators 
pertaining to different fermions may be regarded, in the non-relativistic theory, as 
either commuting or anticommuting (QM, 865). In the relativistic theory, which 
allows transformations of particles into one another, the creation and annihilation 
operators of different fermions must be regarded as anticommuting, like those 
which pertain to different states of the same fermions. 


PROBLEM 


Find the Lagrangian of the spinor field. 


SOLUTION. The Lagrangian corresponding to Dirac’s equation is given by the real scalar expression 
Were = = 
L =2( hy" duh — dua --y"ys) — mip. (1) 


Taking the components of w and wW as the ‘generalized coordinates’? g, we easily see that the 
corresponding Lagrange’s equations (10.10) are the same as Dirac’s equations for & and w. The overall 
sign of the Lagrangian (like the common factor in it) is arbitrary. Since L involves the derivatives of 
and w linearly, the action S = f L d*x can in any case have no minimum or maximum. The condition 
5S = 0 here gives only a stationary point of the integral, not an extremum. . 

The Lagrangian of the spinor field is obtained by replacing W in (1) by the operator W. Applying 
formula (12.12) to this Lagrangian, we find the current operator (25.7). 


§26. Charge conjugation and time reversal of spinors 


The coefficients Wf). = Upoe ’* which appear with the operators d,, in (25.1) are 
the wave functions of free particles (electrons, say) having momenta p and 
polarizations o: 


(e) 
pao po 


The coefficients &_,,-, of the operators Doo are to be regarded as the wave functions 
of positrons having the same p and a. It is found, however, that the electron and 
positron functions are expressed in different bispinor representations. This is 
evident from the fact that » and w differ in their transformation properties and 
their components satisfy different sets of equations. To eliminate this defect, it is 
necessary to carry out a certain unitary transformation of the components w_,_,, 


+ In this argument it is assumed that all particles with the same spin obey the same statistics 
(whatever the way in which they are. ‘‘compounded”’). The truth of this assumption is seen by 
analogous arguments. For example, if there existed fermions with spin zero, then a fermion with spin 
zero and one with spin 3 would yield a particle with spin 1, which would be a boson, in contradiction with 
the general result demonstrated for spin 3. 
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such that the new four-component function satisfies the same equation as y,,.t This 
will be referred to as the wave function of the positron (with momentum p and 
polarization ao). Denoting the matrix of the required unitary transformation by Uc, 
we may write 


OS UCN jue: (26.1) 


The operation C whereby this function is obtained from wW_,_, is called charge 
conjugation of the wave function (H. A. Kramers, 1937). This concept is, of 
course, not restricted to its application to plane waves: for any function w, there 
exists a “charge-conjugate”’ function 


Cuw(t, r) = Ucih(t, r), (26.2) 


which has the same transformation properties as w and satisfies the same equation. 
The properties of the matrix Uc follow from this definition. If & is a solution of 
Dirac’s equation (yp — m)p = 0, then w satisfies the equation 


W(yP+m)=0, or (Fp +m)p =O. 
Multiplying this equation on the left by Uc: 
Ucypi + mUcts = 0, 
we apply the condition that the function Ucw satisfies the same equation as w: 
(yp — m)Uct = 0. 


A comparison of the two equations gives the following “‘commutation relation” 
between Uc and the matrices y":4 


Uch" =—y"Uc. (26.3) 


We shall further suppose that the wave functions are stated in the spinor or 
standard representation; the general case of any representation will be considered 
only at the end of this section. In these representations, we have 


0,2 ~0,2 1,3 _ 1,3 
Y =7Y 5 Y 7 Y ’ 


(-y!3)* = yobs. yr* = —y?. (26.4) 


Then the condition (26.3) is satisfied by the matrix Uc = yncy’y’, the constant nc 
being arbitrary. The condition C?=1 shows that |nc|?=1, and the matrix Uc is 

+ For particles with spin zero, this problem did not arise, since the scalar functions W and ys* satisfy the 
same equation, and W*, is identical with wp. 


+ From this there follows also the equation 


Ucy’ = y Uc. (26.3a) 
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therefore determined apart from a phase factor. We shall take nc = 1; thus 
Uc =y’y° = -ay. (26.5) 


Noting also that & = b* y= 7°u* = y°*, we may write the effect of the operator C 
as 


Ch = yh = y7b*. (26.6) 
The explicit form of the transformation (26.6) for the spinor representation is 
C: &>-in™, Ne —1€%, (26.7a) 
or, equivalently, 
C: & ings 9 nt >—ié™*. (26.7b) 
The charge-conjugation transformation for the plane waves wz», 1S easily 


carried out by using the explicit expressions (23.9) for the plane waves and the 
matrix U¢ in the standard representation: 


_({ 9 ~ Oy 
Uc = es ‘ ) (26.8) 
Since 
aya* = — Goy, 


we have, if w‘”” is defined as in (23.16), 
O Cl p25 = Uys Ucu_p—o = Upe- (26.9) 
Thus 
Chas (26.10) 


so that the functions W_,-, which appear with the operators Doo in the w-operators 
(25.1) do in fact correspond to states of a particle having momentum p and 
polarization o. We see also that the electron and positron states are described by 
the same functions: 


(e) — f,(p) — 
pao” be = Woo 


This is to be expected, since the functions &,, embody information only as to the 
momentum and polarization of the particle. 

The operation of time reversal may be treated similarly. When the sign of the 
time is changed, the wave function must change to the complex conjugate. In order 
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to obtain the “‘time-reversed”’ wave function (Tw) in the same representation as the 
original , we must also perform some unitary transformation on the components 
of w* (or &). Thus the action of the operator T on w can be written, similarly to 
(26.2), as 


T(t, r) = Urp(-t, v), (26.11) 


where U7 is a unitary matrix. 
Dirac’s equation satisfied by w is 


(ins + iy: V- m \ w(t r) = 0, 

and the equation for uJ is 
ae ee . 

(is ae iy: V+ m) dt, r)=0. 

In the latter equation we change t into —t and multiply on the left by —Ur: 
(iv19" = —iUry: v) a(t r) — mU7h(—t, R) = 0. 
We want the function Uris(-t, r) to satisfy the same equation as w(t, r): 
Gar + iy: v) Uris(-t, r) — mU7(-t, r) = 0. 


Comparing the two equations, we find that the matrix U; must satisfy the 
conditions 


Ury°=yUr, 9 Ury =—yUr. (26.12) 


In the spinor and standard representations, these conditions are satisfied by the 
matrixt 


Ur =iyy'y®. (26.13) 
Thus the effect of the operator T is given by 
Tw(t,r) = iy’y' yw, 1) = iy’ y'y*t v). (26.14) 
The explicit form of this transformation for the spinor representation Is 
T: &*>—-ié%, Ng in, (26. 15a) 


+ The choice of the phase factor in (26.13) depends on that in (26.5); see the second footnote to §27. 
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or 
T: &,7i€*, ni —in;. (26.15b) 


In the standard representation, 


le @ : ) (26.16) 


O- -=0, 
To find the effect on w of all three operations P, T and C, we write successively 


Twit, r) = —iy'y'y*(-t,n), 
PT w(t, r) = iy (Tw) = y°y'y°e*(-t, -4), 
CPT Wit, r= y(y°y'yiw*)* = yyy y w(t, — 4), 
or 


CPT Wit, r) = iy’w(-t, — 1). (26.17) 
In the spinor representation, 
CPT: €°> =I, Ni Na, (26.18) 


as is also easily seen directly from the transformation rules (20.4), (26.7) and 
(26.15).t 

The expressions given above for the matrices Uc and Ur assume the spinor or 
standard representation of w. Let us finally see which properties of these expres- 
sions are retained for any representation of w. 

If & is subjected to a unitary transformation: 


y= Ub, y= UYU", yl = py" = pU* = gu, (26.19) 
then in the new representation we have 
(Cy) = U(Cp) = UU cb = UUc(p'U) = UU UY". 
A comparison with the definition of the matrix U¢ in the new representation, 
(Cyw)’ = U cw’, shows that 


Uc= UUcU (26.20) 


The transformation (26.20) is the same as that of the matrices y only if U is real. 


The expression (26.5) also is therefore valid only in representations which are a real 
transformation of the spinor or standard representation. 


+ The notation CPT implies that the operators.act in the sequence from right to left. The sign of (26.17) 
and (26.18) depends on this sequence, since T does not commute with C and P (as regards their action ona 
bispinor). 
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The matrix (26.5) is unitary, and changes sign when transposed: 
UUt=1, Uc=—-Uce (26.21) 


These properties are unaffected by the transformation (26.20), and are therefore 
retained in any representation. The matrix (26.5) is also Hermitian (Uc = UO), but 
this property is in general not preserved by the transformation (26.20). 

The above discussion and formulae (26.21) apply likewise to the properties of 
the matrix Ur. 

In the second quantization formalism, the transformations C, P. T for w-operators 
must be formulated as transformation rules for the particle creation and annihilation 
operators. These rules can be established (as in $13 for particles with spin zero) from 
the condition that the transformed w-operators may be written 


w(t, r)= Ucih(t, r), 
WP (t, r) = iy W(t, -r), (26.22) 


w(t, r) = Un)(-t, r). 


PROBLEM 


Find the charge-conjugation operator in the Majorana representation (§21, Problem 2). 


SOLUTION. The matrix U cin the Majorana representation is obtained from the matrix Uc = — ay inthe 
standard representation by the transformation (26.20) with U = (a, + B)/V2, and is Ué= —ay (where ay 
and B denote matrices in the standard representation). If primes denote quantities in the Majorana 
representation, we have Cw’ = U ((w'*B’'), and since B’ = ay, 


Cw’ oe aty (f'* ary ) = ayayyy’* = p'*, 


i.e. charge conjugation is the same as complex conjugation. 


§27. Internal symmetry of particles and antiparticles 


The wave function of a particle with spin 3, in its rest frame, is a single 
three-dimensional spinor, which will be denoted by ®*. The behaviour of this 
spinor under inversion is related to the concept of the internal parity of the particle. 
However, as mentioned in $19, although the two possible laws of transformation of 
three-dimensional spinors (®* > +i®*%) are not equivalent, there is no absolute 
significance in assigning a particular parity to a spinor. We therefore cannot speak 
of the internal parity of any one particle with spin 5, but we can refer to the relative 
internal parity of two such particles. 

From two (three-dimensional) spinors ®” and ®®, a scalar @Y'O°* can be 
formed. If this is a true scalar, the particles described by the spinors are said to 
have the same internal parity; if it is a pseudoscalar, they are said to have opposite 
internal parities. 

We shall show that particles and antiparticles (with spin 3) have opposite parities 
(V. B. Berestetskii, 1948). 

Firstly, if the operation C (26.7) is applied to both sides of the P transformation 
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(19.5) (in the spinor representation) 

P: &*> ing, Na 1E%, (27.1) 
we obtain 

BME, EL in, 


where the index c marks the components of the bispinor 


Wo = es charge-conjugateto p= ) 


Taking the complex conjugate and interchanging the indices, we find 
P: g7ié™, ES > ing. (27.2) 


Thus charge-conjugate bispinors are transformed in the same manner by inversion. 

Let be the wave function of a particle (say an electron) and & that of the 
antiparticle (a positron). The latter is a bispinor which is the charge conjugate of a 
‘“negative-frequency” solution of Dirac’s equation. In the rest frame, each function 
becomes a three-dimensional spinor: 


Ee = 7) = Por EP) QP) = HO 
According to (27.1), (27.2), these spinors are transformed as follows by inversion: 
p* > iP’, (27.3) 


the same for both ®® and ®”. The product 0”, however, changes sign, and 
this proves the result stated above. 

A strictly neutral particle is one which coincides with its antiparticle ($12). The 
w-operator of a field of such particles satisfies the condition 


w(t, r) = C(t, r). 


For particles with spin 3, this implies the conditions (in the spinor representation)t 


A 


é*= -if**, 9 ha = — i€2. (27.4) 


Like any relation which expresses a physical property, these conditions are 
invariant under the transformation CPT.¢ It is easily verified that they are in fact 


+ In the Majorana representation, strict neutrality implies simply that the operator #' is Hermitian; 
see §26, Problem. 

t More precisely, the transformation CPT must here be defined so as to leave relations such as 
(27.4) unchanged. This is achieved by an appropriate choice of the phase factor in the definition of the 
matrix Ur; see the third footnote to §26. 
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invariant not only with respect to CPT but also with respect to each of the three 
transformations separately. 

In §19, inversion of spinors was defined to be a transformation for which 
P*=~—1, and this definition has been used so far. The result derived above 
concerning the relative parity of particles and antiparticles is easily seen to be, as it 
should be, independent of the way in which inversion is defined. 

If inversion is defined by the condition P? = 1, then (27.1) becomes 


P: & >a Nar e- (27.5) 
The charge-conjugate function is then transformed according to 
Cera: | Wee. 


which differs in sign from (27.5). Accordingly, the three-dimensional spinors ® will 
be transformed thus: 


pee Pe, pre _, — Pe 


and the product ®°'@” will again be a pseudoscalar. 

The only possible difference in the physical consequences of. the two views of 
inversion is that with the definition (27.5) the condition for a strictly neutral field 
would not be invariant under this transformation (or the transformation CP), 
which would alter the relative sign of the sides of the equations (27.4). Actually, no 
strictly neutral particle with spin 7 is known, and we cannot yet say whether this 
difference between the two definitions of inversion has any real physical meaning.t 


PROBLEM 


Find the charge parity of positronium (a hydrogen-like system consisting of an electron and a 
positron). 

SOLUTION. The wave function of two fermions must be antisymmetric with respect to simultaneous 
interchange of coordinates, spins and charge variables of the particles (cf. §13, Problem). The 
interchange of the coordinates multiplies the function by (—1)', that of the spins by (—1)'**, where S (=0 
or 1) is the total spin of the system, and that of the charge variables by the required parity C. The 
condition 


(-10(-1)'C = —| 


gives C = (—1)'**. Since the internal parities of the electron and the positron are opposite, the spatial parity 
of the system is P = (—1)'*'. The combined parity CP = (—1)**'. 


§28. Bilinear forms 


Let us consider the transformation properties of various bilinear forms which 
can be constructed from components of the functions Ww and W*. Such forms are of 
great importance in quantum mechanics. They include the current density 4-vector 
(21.11). 


+ The incomplete equivalence of the two definitions of inversion was first noted by G. Racah (1937). 
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Since w and w* have four components each, a total of 4x 4= 16 independent 
bilinear combinations can be formed from them. The classification of the trans- 
formation properties of these is evident from the ways listed in §19 of multiplying 
any two bispinors (in this case W and W*). We can form a scalar (denoted by S), a 
pseudoscalar (P), a mixed spinor of rank two equivalent to a true 4-vector V" (four 
independent quantities), a mixed spinor of rank two equivalent to a 4-pseudovector A” 
(four quantities), and a bispinor of rank two equivalent to an antisymmetric 4-tensor 
T"” (six quantities). 

In a symmetrical form (for any representation of ys), these combinations may be 
written 


S= hp, P=ibyy, 


: _ _ (28.1) 
Vi=dby'y, At=dy'yd, T= ibo'y, 


where 
ot? =(y"y’ — yy) = (a, id); (28.2) 


the components in (28.2) are stated as in (19.15).f All the expressions given above 
are real. 

The fact that S is a scalar and P a pseudoscalar is evident from their spinor 
representations: 


S=&y+n*& P=il(€*n- 7"), 


which agree with (19.7) and (19.8). The fact that the V" form a vector is then evident 
from Dirac’s equation: multiplying the equation p,y"w = mw on the left by w, we 
obtain 


Uppy = min. 


Since the right-hand side is a scalar, so is the left-hand side. 

The rule whereby the quantities (28.1) are obtained is obvious: they are 
constructed as if the matrices y" formed a 4-vector, y° were a pseudoscalar, and 
the w and on either side together formed a scalar. The non-existence of bilinear 
forms which are symmetrical 4-tensors is seen from the spinor representation and 
also from this rule: since the symmetrical combination of matrices 1s y"y’ + y”y" = 
2g"", any such form would reduce to a scalar. 

The second-quantized bilinear forms are obtained by replacing the w-functions 
in (28.1) by w-operators. For greater generality, we shall assume that the two 
w-operators relate to fields of different particles, denoted by suffixes a and b. Let 


_ + For a unitary transformation of w (change of representation), we have ~> Uw, y>UyU"'," 
w > wU"', and the invariance of the bilinear forms under such transformations is obvious. 
+ The ‘pseudoscalar’ nature of y° is itself in accordance with these rules, since 


5 i ; 
y = 54 Cs yy y’. 
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us see how such operator forms are transformed under charge conjugation. We 
havet 


Uo =Uch, bo = Ubt, (28.3) 


and therefore, using (26.3) and (26.21), 


WSUS = baU BUCH 
= —h,U bU chy 
a oe atl, 
Voy if = aU By"Uchy 
=~ h.Uéy"Uchy 
= eI". 


When the operators are restored to their original order (& to the left of w), the 
Fermi commutation rules (25.4) show that the sign of the product is changed (and 
moreover terms appear which are independent of the state of the field; we omit 
these, as in the corresponding treatment in $13). Thus we have 


abs = Woh, Way" = —dpy"he 


Proceeding similarly with the other forms, we find the results for charge con- 
jugationt 


Cc Sab oe Soas Pu > Pras Ve, ar a Vio 
At,> Ag, TH >—Tes, ane 


The behaviour of these forms under time reversal may be ascertained similarly, 
remembering (see $13) that this operation brings about a change in the order of the 
operators, so that, for example, 


(hebe)” = bh. 


+ To derive the second of these equations from the first, we write 
v= [U&iy™)]" 
= PUEY 
=-YUCy 
=H y"*U cb 
= Utt, 
using (26.3), (26.21) and the fact that y° is Hermitian. 
+t It should be noticed that, for bilinear forms constructed from w-functions (not W-operators), the 


transformations (28.4) would have the opposite signs, since the return to the original order of the factors 
w and Ww would not involve a change of sign. 
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Substituting here 
b'= Urb, Ub" =-UFd, (28.5) 
we obtain 
(Wath) = OU ihe 
= fpUTU tf, 
= Votbo- 
Treating the other forms in the same way, we obtian 


Be Sub => Shas Ps —? —P,,, (Vv°, V)ap 7g. (Gar —V)bas 


ae ae he Be, ike » (28.6) 
(A ’ A)ab a (-A A)bas ‘Ba ia (p, &)ab > (p, ~~ &)ba; 
where p and 4 are three-dimensional vectors equivalent to the components of T”’ 
as shown in (19.15). 
Under spatial inversion we have, in accordance with the tensor properties,t 


P* S4 ed Sa: Ps 2 == Pie (V°, V)ab ae (v°, —V)abs 


path doy 3 (28.7) 

(A®, A)ab > (—A®, A)bas T a = (p, a)ab > (—P, A)an. 
Finally, the simultaneous application of all three operations leaves all the Sub» Pap 
and rey unchanged, and changes the sign of all the V4, and AX: in agreement with 
the fact that this transformation is a 4-inversion: since 4-inversion is equivalent to a 
rotation of the 4-coordinates, it creates no difference between true tensors and 
pseudotensors of any rank. 

Let us now consider products of pairs of bilinear forms constructed from four 
different functions w*, w’, Ww, w*. The result depends on which pairs of functions 
are multiplied together. It is possible, however, to reduce any such product to 
products of bilinear forms with specified pairs of factors (W. Pauli and M. Fierz, 
1936). We shall derive the relationship on which this reduction is based. 

If we take the set of four-rowed matrices 


1,7’, ¥", iy*y’, io™, (28.8) 


where 1 is the unit matrix, arrange these 16 (=1+1+4+4+6) matrices in any 
definite order and denote them by y“ (A=1,..., 16), and also denote the same 
matrices with lowered 4-tensor indices (u,v) by ya, then they will have the 
following properties: 


try*=0 (y4# 1), (28.9) 


yoy = |, itr y “yp ='6o. 


+ To avoid any misunderstanding, it should be mentioned that the transformations T and P also 
involve a change in the arguments of the functions; the right-hand (transformed) sides in (28.6)-(28.7) 
are respectively functions of x’ =(-t,r), x’ =(t,—r), when the left-hand sides are functions of 
x =(t,r). 
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The last of these shows that the matrices y“ are linearly independent. Since their 
number is equal to the number (4x4) of elements of a four-rowed matrix, the 
matrices y“ form a complete set in terms of which an arbitrary four-rowed matrix 
I’ may be expressed: 


[=a cx Sata; (28.10) 
A 
or, in explicit form, with matrix suffixes i, k = 1, 2, 3, 4, 


Dix = 4 2 Tim’ Vint Aike 


Assuming, in particular, that in the matrix I only the element Ij, 1s non-zero, we 
obtain the required relation (the ‘““completeness condition’’): 


0i10km = i > YaikY nl: (28.11) 
Multiplying both sides of this equation by Uwe Gwi, we have 
(iw NW?) = 4D (hiya? yey *W). (28.12) 


This is one equation of the type mentioned above, reducing the product of two 
scalar bilinear forms to products of forms involving other pairs of factors.f 
Other equations of the same type may be obtained from (28.12) by the changes 


Wioyyt wo yw’, 


using the expansion 
yey? = 2 cry®, cr = ate y*y? yp 


(see the Problem). 

Here we may also give for future reference, the relation for two-rowed matrices 
which corresponds to (28.11). The complete set of linearly independent two-rowed 
matrices o“ (A = 1, 2, 3, 4) is 


1, Ox, Fys Oe (28.13) 


These have the properties 


tra*=0 (o# 1), 
(28.14) 


1 
strata? = Sap. 


+ It should be mentioned, to avoid any misunderstanding, that we are referring here to forms 
constructed from w-functions. The sign of the transformation would be opposite for forms constructed 
from the anticommuting w-operators. 
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The completeness condition is 


bay gs = , > O20 y 


= +O up ° Wsy an 25853 (28.15) 
(a, B, y, 6 = 1, 2), or 
Dap? Fsy = 30 ay * sg cm 58 ayDsp- (28.16) 
PROBLEM 


Derive formulae analogous to (28.12) for products of pairs of bilinear forms, P, V, A, T. 
SOLUTION. We use the notation 


Js = (WW yoy"), Jp = (WW bey), | 
Jv = (yw yb yw), Ja = (Wb iy* yw yiny *), 
Jr = (bio W’ \(hio,ab"), 


and the same symbols with primes to denote the products with w” and ws interchanged. The method 
used above gives 


4Js= Js+ Jv+ Jr+ Ja+ Sp, 
4J¥=4Js5 — 2Jy + 234 — 4Jp, 
4J7= 6Js 27 + 6Jp, 
4J,= 435+ 2Jy — 2J4 — 4Jp, 
4Jp= Js— Jvt+ Jr- Jat Jp, 


the first of these equations being the same as (28.12). 


§29. The polarization density matrix 


The dependence on the coordinates of the wave function wW which describes 
free motion with momentum p (a plane wave) reduces to a common factor e'?'', 
and the amplitude u, acts as a spin wave function. In such a state (a pure state), the 
particle is completely polarized (see QM, §59). In the non-relativistic theory this 
means that the particle spin has a definite direction in space (more precisely, there 
exists a direction in which the spin component has the definite value +). In the 
relativistic theory, this description of a state in an arbitrary frame of reference is 
not possible, because, as already mentioned in $23, the spin vector is not con- 
served. The term ‘“‘pure state”’ signifies only that the spin has a definite direction in 
the rest frame of the particle. 

In a state of partial polarization, there is no definite amplitude, but only a 
polarization density matrix p;, (i, k = 1, 2, 3, 4 being bispinor indices). We define this 
matrix in such a way that in a pure state it reduces to products: 


Pik = Upill pk. (29. 1) 
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Accordingly the matrix p is normalized by the condition 
trp =2m; (29.2) 


cf. (23.4). 
In a pure state, the mean value of the spin is given by the quantity 


= a fip-y°S Up. (29.3) 
The corresponding expression for a state of partial polarization is 


_ | i 
s= 7 tr (py°) = ra tr (py’y). (29.4) 


The amplitudes u, and i, satisfy the algebraic equations (yp —m)u, = 0, 
i,(yp —m)=Q. The matrix (29.1) therefore satisfies the equations 


(yp —m)p = p(yp — m) = 0. (29.5) 


The density matrix must satisfy similar linear equations in the general case of a 
state which is mixed (with respect to the spin); cf. the analogous argument in QM, 
$14. 

If we consider a free particle in its rest frame, the non-relativistic theory is 
applicable. In that theory the state of partial polarization is completely defined by 
three parameters, the components of the mean spin vector s (see QM, §59). It is 
therefore clear that the same parameters will define the polarization state after any 
Lorentz transformation, i.e. for a moving particle. 

Let twice the mean spin vector in the rest frame be denoted by ¢; in a pure state 
\c| = 1, in a mixed state |C)<1. For a four-dimensional description of the polariza- 
tion state it is convenient to define a 4-vector a” which in the rest frame its the same 
as the three-dimensional vector ¢; since € is an axial vector, a” 1s a 4-pseudovector. 
This 4-vector is orthogonal to the 4-momentum in the rest frame (in which 
a* =(0,C), p” =(m, 0)); in any frame, we therefore have 


a"p, = 9. (29.6) 
In any frame, moreover, 
a,a" = —(, (29.7) 


The components of the 4-vector a” in a frame in which the particle is moving 
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with velocity v= p/e are found by a Lorentz transformation from the rest frame, 
and are 


q’ = pl Gp a=, a= matt (29.8) 


where the suffixes || and 1 denote the components of the vectors ¢ and a parallel and 
perpendicular to the direction of p.t These formulae may be expressed in vector form: 


a=c+ Pep gS PoP SE pap PP (29.9) 


Let us first consider the unpolarized state (¢ = 0). The density matrix in this case 
can contain as parameters only the 4-momentum p. The only form for such a 
matrix which satisfies the equations (29.5) is 


p =2(yp +m) (29.10) 


(I. E. Tamm, 1930; H. B. G. Casimir, 1933). The constant coefficient is chosen in 
accordance with the normalization condition (29.2). 

In the general case of partial polarization (€ #0), we seek the density matrix in 
the form 


I 
p= 7 (yp + m)p'(yp +m), (29.11) 


which necessarily satisfies the equations (29.5). When ¢ = 0, the auxiliary matrix p’ 
must become a unit matrix; since (yp + m)’ = 2m(yp + m), (29.11) is then the same 
as (29.10). The matrix p’ must also contain the 4-vector a linearly as a parameter, 
i.e. must be of the form 


p'=1-—Ay*(ya); (29.12) 


the second term includes the “scalar”? product of the pseudovector a and the 
‘‘matrix 4-pseudovector” y°y. To determine the coefficient A, we write the density 


+ As regards their transformation properties, the components of the mean spin vector § (like those of 
any angular momentum) are, in relativistic mechanics, the space components of an antisymmetric tensor 
S**. The 4-vector a’ is related to this tensor by the equations 


Ap __ 1 Apvp A 2 App 
S = Im € AvP ps a= om é SurPp- 


It must be emphasized that, in an arbitrary frame of reference, the spatial part a of the 4-vector a” is not 
the same as the vector 2s: we easily see that 


7 1 
25; = (aye — a"lp|) = fi, 


2 E E 
28, =—a.=—C.. 
m m 
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matrix in the rest frame: 


p=am(1+ y+ Ayy: OU + y%) 
= am(1+ y+ Ayy-O 


and calculate the mean spin by (29.4). Using the rules given in §22, we easily find that 
the only non-zero term in the trace is 


Stee 5 
2s= 5 tr (py 'y) 


= —4Atr[(y- Oy] 
= At. 


Equating this to €¢, we have A=1. The final expression for p is obtained by 
substituting (29.12) in (29.11) and interchanging the factors p' and yp +m; since a 
and p are orthogonal, the product yp anticommutes with ya: 


(ya)(yp) = 2ap — (yp )(ya) = — (yp )(ya), 


and therefore commutes with y°(ya). 
Thus the density matrix of a partially polarized electron is given by the 
expression 


p =xyp + m)(1 - y°(ya)) (29.13) 


(L. Michel and A. S. Wightman, 1955). If the matrix p is known, the 4-vector a 
which describes the state can be found from 


at = "tr (py'7"), (29.14) 


and the vector ¢ is therefore also known. 

The formulae for the density matrix of the positron are entirely analogous to 
those of the electron. If the positron (with 4-momentum p) were described by a 
positron amplitude u®°? and by a density matrix p'° defined on the basis of this 
amplitude, then there would be no difference from the case of the electron, and the 
matrix p‘°°? would be given by the same formula (29.13). But, in actual calculations 
of cross-sections of scattering processes involving positrons, it is necessary (as we 
shall see below) to deal not with u'?°? but with the “negative frequency” amplitudes 
u_,. Accordingly, the polarization density matrix (which we denote by p“”’) must be 
defined so as to reduce to u_,;u#_,, for a pure state. 

According to (26.1), the positron amplitude u‘°? = Ucit_,. Conversely, 


u_, = Ucii?, i_,» = Ucu?? = uv? US; 


QE4 -4 
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cf. (28.3). If 


Pik? = U-pittpry pk = up aR”, 


then these formulae give 
p> = Ucp®™ UF. (29.15) 


Substituting for p®°? the expression (29.13), we obtain, after some simple 
rearrangements using (26.3) and (26.21), 


p = (yp — m)(1- y(ya)) (29.16) 


In particular, for an unpolarized state 


po =yp — m). (29.17) 


In referring to positron density matrices below, we shall intend the matrices p“, 
and the index (—) will be omitted; the matrices p°? will not be needed in practice. 

In various calculations it will often be necessary to average over spin states an 
expression such as uFu (=i,F,u,) where F is a (four-rowed) matrix and wu is the 
bispinor amplitude of a state having a definite 4-momentum p. This averaging is 
equivalent to replacing the products u,u; by the density matrix p,; of a partially 
polarized state. 

In particular, complete averaging over two independent spin states is equivalent 
to changing to an unpolarized state, and, by (29.10), we have 


5 > apFu, =3tr (yp + m)F. (29.18) 
polar. 
Similarly, for wave functions with negative frequency, 


+>) Gi_,Fu_, =4tr (yp — m)F. (29.19) 


polar. 


If summation over spin states replaces averaging, the result is doubled. 

Let us see how the density matrix (29.13) tends to its non-relativistic limit. To 
do so, we use the rest frame of the electron. In the standard representation of the 
wave functions, the amplitudes u, in this frame have two components in the limit, 
and the density matrix must accordingly have two rows. For, in the rest frame, 


=im(y°+ I(l—-y*y- 9), 


and we find from the expressions (21.20) and (22.18) for the matrices y 


re a rons = ml +o - 0), ( 29.20) 
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the zeros denoting two-rowed null matrices. If we use the normalization of the 
density matrix to unity (tr pnons = 1), aS is customary in the non-relativistic theory, 
instead of the normalization to 2m, then the above expression must be divided by 
2m, giving 


(1 +o -) 


in agreement with QM, (59.4), (59.5). 
Similarly, the non-relativistic limit for the positron density matrix is 


_/0 0 | . 
_ (5 Pnon-r ) ao mn oe 0). 


Finally, there is a simpler expression for the density matrix in the ultra- 
relativistic case. Putting in (29.8) |p| ~ « (and thereby neglecting small quantities of 


order (m/e)’), substituting these expressions in (29.13) or (29.16), and taking the 
direction of p as the x-axis, we can write 


p =e ye [1-¥(Z 0% yG+e.-y.)], 


Where the upper sign refers to the electron and the lower sign to the positron. 
When the product is expanded, the leading terms cancel, and those of the next 
order give 


p =2e(y— y I - v4 Gt yw) 
or, again writing e(y°— y’) in the form yp, 
p=2UypP)IL- y(t G4+G-yI- (29.21) 


This is the required expression for the density matrix in the ultra-relativistic 
case. It should be noticed that all the components of the polarization vector ¢ 
appear in this expression equivalently, as terms of the same order of magnitude. It 
will be recalled that {) is the component of this vector parallel Gf ¢)>0) or 
antiparallel (if ¢;< 0) to the particle momentum. In particular, for a helicity state of 
the particle, ¢;=2A = +1; the density matrix then has an especially simple form, 


p =2 yp(1+2ay’), (29.22) 


which is, as it should be, the same as the neutrino or antineutrino density matrix, 
these being particles with zero mass and definite helicity (see $30). 


§30. Neutrinos 
We have seen in §20 that the necessity of two spinors (€ and 7) to describe a 


particle with spin 3 is due to the mass of the particle. This necessity disappears if the 
mass is zero. The wave equation which describes such a particle can be derived froma 
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single spinor, say the dotted spinor 7: 
png =0, (30.1) 
or, equivalently, 
(pot+p:a)n =0. (30.2) 


It has also been noted in §20 that the wave equation containing the mass m is 
necessarily symmetrical with respect to inversion (the transformation (20.4)). When 
the particle is described by a single spinor, this symmetry Is lost, but it is not needed, 
since symmetry with respect to inversion is not a universal property of Nature. 

The energy and momentum of a particle with m = 0 are related by ¢ = |p|. Fora 
plane wave (n, ~ e *), equation (30.2) therefore gives 


(n+ O)Np =~: (30.3) 


where n is a unit vector in the direction of p. A similar equation, 
(n-o)n_-p = — 1-p, (30.4) 


applies to a wave with ‘“‘negative frequency” (n_, ~ e’”*). 
The second-quantized w-operator is 


(30.5) 


Hence it follows, as usual, that »*, are the wave functions of the antiparticle. 
The definition (20.1) of the operators p** shows that p°?* = — p°®. The complex 
conjugate spinor n* therefore satisfies the equations pins = 0, or, equivalently, 


Bapn?* = (0). 


We write nP* = €° thus expressing the fact that complex conjugation changes the 
dotted to the undotted spinor. The wave function of the antiparticle then satisfies the 
equation 


Dapé” = 0, (30.6) 
Or 


(Po p> ag =0. (30.7) 


Hence, for a plane wave, 


(n> o)& = &. (30.8) 
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But 3n-o is the operator which projects the spin on the direction of motion. 
Equations (30.3) and (30.8) therefore signify that states of the particle with a definite 
momentum are necessarily helicity states, for which the spin component in the 
direction of motion has a definite value. If the particle spin is opposite to the 
momentum (helicity —3), then the antiparticle spin is along the momentum (helicity 
+5), 

The neutrinos which occur in Nature appear to be particles possessing these 
properties. The particle with helicity —3is conventionally called the neutrino, and that 
with helicity +3 the antineutrino.t 

In connection with the fact that the neutrino states are not degenerate with respect 
to spin directions, reference should be made to the comment in 88 that a massless 
particle has only axial symmetry about the direction of the momentum. For a strictly 
neutral particle (the photon), this symmetry includes both rotations about the axis and 
reflections in planes passing through the axis. For the neutrino, there is no reflection 
symmetry, leaving only the group of rotations about the axis, which conserve the 
angular momentum component along the axis and do not change its sign. The 
symmetry with respect to reflections exists only if the particle is at the same time 
replaced by the antiparticle. 

It should also be noted that the necessarily longitudinal polarization signifies that 
the spin of the neutrino cannot be distinguished from its orbital momentum, just as 
with the necessarily transverse photon fields (86). 

From one spinor 7 (or €), only four bilinear combinations can be constructed, 
which together form the 4-vector 


j* =()*n, n* on). (30.9) 
It is easily verified that the equations 
(pot p:a)yn = 0, n*(po- p> oa) =0 


imply the continuity equation 0,j" = 0, so that j* acts as a particle current density 
4-vector. 

It is convenient to normalize neutrino plane waves in a manner similar to that used 
in §23 for particles possessing mass: 


a2 I —ipx a 1 ipx 
"Np — V(2e) Upe ; Np — (2e) U-pe (30.10) 
the spinor amplitudes being normalized by the invariant condition 
u*,(1, o)u., = 2(e, p). (30.11) 


The particle density and particle current density are then j°= 1, j=p/e =n. 
Since a free neutrino with a given momentum is always completely polarized, there 


+ The existence of neutrinos was theoretically predicted by W. Pauli (1931) in order to explain the 
properties of 8-decay. Equation (30.1) was first discussed by H. Weyl (1929). The neutrino theory based on 
these equations was evolved by L. D. Landau, T. D. Lee and C. N. Yang, and A. Salam (1957). 
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are no states which are mixed (with respect to the spin). It may nevertheless be 
convienent to define a two-rowed polarization “‘density matrix”’ simply as the spinor 
of rank two 


Dig = Uap, (30.12) 


for which tr p = 2e. An expression for this matrix can be obtained by noting that it 
must satisfy the equations 


(e+p:o)p=plet+p:a)=0. 
Hence we have 
p=eE-p-a. (30.13) 


In the consideration of various interaction processes, neutrinos may appear 
together with other particles having spin 3 and possessing mass, which are therefore 
described by four-component wave functions. In such cases it is convenient to retain 
uniformity of notation by formally defining for the neutrino also a “‘bispinor’’ wave 
function having two components zero: 


But this form of w is in general not preserved in another (non-spinor) representation. 
This difficulty can be overcome by noting that in the spinor representation we have 
identically 


HZ Q)=0) rer oroy 


where & is an arbitrary ““makeweight” spinor which does not appear in the final result; 
the matrix y° is given by (22.18). The condition for a true “two-component” neutrino 
will therefore be maintained when it is described by a four-component w in any 
representation, if & is taken to be the solution of Dirac’s equation with m = 0: 


(yp)e = 90, (30.14) 
subject to the additional condition 3(1 + y°)W = WJ, or 
yw = y, (30.15) 


This condition may be taken into account by replacing Ww and wW, wherever they 
would occur, by the following expressions: 


5 


yoy FI (30.16) 
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For example, using these expressions in wy", the current density 4-vector may be 
written in the form 


j*=wd-y)y"+ ye 
= syry"(1 + yeh. (30.17) 


In the same way, the four-rowed neutrino density matrix becomes 


p =4(1t+ y yp) — y°) = 20. + y°)(yp). (30.18) 


In the spinor representation it reduces, as it should, to the two-rowed matrix (30.13), 


=i 0 | 
p e-o:p O/ 


The corresponding formulae for the antineutrino differ from those given above by 
a change of the sign of y”°. 

The neutrino is an electrically neutral particle, but, with the properties described 
above, itis not a strictly neutral particle. Here it must be noted that the “neutrino field”’ 
described by a two-component spinor is equivalent, as regards the number of possible 
particle states (but not, of course, as regards its other physical properties) to a strictly 
neutral field described by a four-component bispinor. Instead of states of particles and 
antiparticles with definite helicity, we should here have the same number of states of 
one particle with two possible values of the helicity, and the property of symmetry 
with respect to inversion would automatically occur. However, the zero mass of the 
“four-component” neutrino would be, so to speak, “‘accidental’’, since it would be 
unrelated to the symmetry properties of the wave equation describing the neutrino 
(which allows also a non-zero mass). The various interactions of such a particle would 
therefore necessarily imply the existence of a small but not zero rest mass. 


§31. The wave equation for a particle with spin 3/2 


A particle with spin 3/2 is described, in its rest frame, by a three-dimensional 
symmetrical spinor of rank three (having 2s5+1=4 independent components). 
Correspondingly, in an arbitrary frame of reference, its description may involve the 
4-spinors €“’, nag, and 6°, y«g,, each of which is symmetrical in all the indices of one 
kind (dotted or undotted). The spinors in each pair are interchanged by inversion. 

In order that the 4-sponors €°°’ and nag, should become in the rest frame 3-spinors 
symmetrical in all three indices, they must satisfy the conditions 


BD nagy=0, Papé*? =0: (31.1) 


in the rest frame we have 


ps —> pods a més, 
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as is seen from (20.1), and the conditions (31.1) therefore imply the equations 
6 ony = 0; BoE by = 0, 


where the primed letters denote the corresponding three-dimensional spinors. Thus 
these spinors give zero on contraction with respect to the indices a, B, which means 
that they are symmetrical in these two indices, and hence in all three. 
The differential relations between the spinors é and y are 
p°’nés i més”, 
tees 31.2 
BsyEE" = mnbs. Cee) 
The conditions (31.1) ensure that the left-hand sides of (31.2) vanish on con- 
traction with respect to any other pair of indices, and hence that these quantities are 
symmetrical in B, y or a, 6. In the rest frame, the three-dimensional spinors é’ and 7’ 
are the same according to (31.2). On eliminating 7 or é€ from (31.2), we find that each 
component of the spinors € and 7 satisfies the second-order equation 


(p> — m)E*) = 0. (31.3) 


The equations (31.1), (31.2) form a complete set of wave equations for a particle 
with spin 3/2. No further results would be obtained by using the spinors ¢ and y. Their 
structure is as follows: 


moor a png’, 
ba: eile cee a 
MX apy = Pas§ By: 


The equations of particles with spin 3/2 can also be put in a different form, using the 
vectorial properties of spinors (W. Rarita and J. Schwinger, 1941; A. S. Davydov and I. 
E. Tamm, 1942). One four-dimensional vector index wp is assigned to a pair of spinor 
indices aB. Thus the components é7°’ of the spinor of rank three can be put into 
correspondence with the components of the “mixed”? quantities #7, which have one 
vector and one spinor index. Similarly, the spinor 7°*’ is correlated with yw”, and the 
two spinors together correspond to a “‘vector’’ bispinor w, (omitting the bispinor 
index). The wave equation then becomes a ‘“‘Dirac equation” for each of the vector 


components y,: 
(yp —m)p, = 9, (31.4) 
with the added condition 
y"Wy = 9. (31.5) 
Using the expressions for the matrices y” in the spinor representation and the 


+ See the paper by Fierz and Pauli, cited in §15, concerning the Lagrangian formulation of these 
equations. 
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relations (18.6), (18.7) between the spinor and vector components, we can easily verify 
that equation (31.4) implies equations (31.2), and that the condition (31.5) is equivalent 
to the condition for the SEOFs €°°7 and n “©” to be symmetrical in the indices By or By. 
Multiplying (31.4) by y* and using (31.5), we find 


y"y"pih, = 9, 
or, with the commutation rules for the matrices y", 
28" Dh — y'Doy"h = 0. (31.6) 
The second term in turn is zero by (31.5), and the first term gives 
p’w, = 0. (31.7) 


This condition, which is implied by (31.4), (31.5), is easily seen to be equivalent to the 
conditions (31.1). 

Finally, yet another way of expressing the wave equation Is to use quantities Wi (i, 
k, |!=1, 2, 3, 4) with three bispinor indices, in which they are symmetrical (V. 
Bargmann and E. P. Wigner, 1948). The set of these quantities is equivalent to the 
components of all four spinors é, n, ¢, y. The wave equation becomes a set of ‘““Dirac 
equations” 


Duy imYma = Mipfin (31.8) 


We easily see that these equations yield the necessary number (four) of independent 
components Wj, and there is therefore no need to impose any further conditions. In the 
rest frame, (31.8) becomes 


Yim Wink = Wikls 


according to which all the components with i, k, | =3, 4 are zero (in the standard 
representation), i.e. the Wj, reduce to the components of a three-dimensional spinor of 
rank three. 

The results given above have an obvious generalization for particles with any 
half-integral spin s. In the description by equations of the form (31.4), (31.5), the wave 
function is a symmetrical 4-tensor of rank 3(2s — 1) with one bispinor index. In the 
description by equations of the form (31.8), the wave function has 2s bispinor indices 
and is symmetrical in these. 


CHAPTER IV 


PARTICLES IN AN EXTERNAL FIELD 


§32. Dirac’s equation for an electron in an external field 


THE wave equations of free particles express essentially only those properties 
which depend on the general requirements of space-time symmetry. Physical 
processes involving the particles, however, depend on their interaction properties. 

The description of the electromagnetic interactions of particles in relativistic 
quantum theory can be effected by a generalization of the method used in classical 
non-relativistic quantum theory. 

This method, however, is applicable to the description of electromagnetic 
interactions only of particles that are not capable of strong interactions. These 
include electrons (and positrons), and the very wide domain of electron quantum 
electrodynamics is therefore accessible to the existing theory. There are also 
unstable particles, the muons, which are not capable of strong interactions; they 
are described by the same quantum electrodynamics as regards phenomena occur- 
ring in times short in comparison with their lifetime (with respect to weak 
interactions). 

In this chapter we shall discuss problems of quantum electrodynamics which 
fall within the scope of single-particle theory. These are problems in which the 
number of particles is unchanged, and the interaction can be represented in terms 
of an external electromagnetic field. Besides the conditions which ensure that the 
external field may be regarded as given, there are conditions arising from “radiative 
corrections” which also impose limits on the validity of such a theory. 

The wave equation of an electron in a given external field can be derived in the 
same way as in the non-relativistic theory (QM, §111). Let A* =(®, A) be the 
4-potential of the external electromagnetic field (A being the vector potential and ® 
the scalar potential). We obtain the desired equation on replacing the 4-momentum 
operator p in Dirac’s equation by p — eA, where e is the charge on the particle f: 


[y(p — eA) — m]y = 0. (32.1) 


The corresponding Hamiltonian is found by making the same change in (21.13): 


A 


H =a: (p—eA)+ Bm+ e®. (32.2) 
The invariance of Dirac’s equation under gauge transformations of the elec- 
tromagnetic field potential is shown by the fact that it is unchanged in form if the 


transformation A— A+ ipy (where x is an arbitrary function) is accompanied by 


+ The charge together with its sign is meant, so that for the electron e = —|e|. 
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the following transformation of the wave function:t 
b> we™; (32.3) 


cf. the corresponding transformation for Schrédinger’s equation in QM, 8111. 
The current density is expressed in terms of the wave function by the same 
formula (21.11), 


j= byy, 


as when there is no external field. It is easily seen that, when the calculations used 
in deriving (21.11) are repeated with the equation (32.1) (and the equation (32.4) 
below), the external field disappears, and the continuity equation contains the same 
expression for the current as previously. 

Let us apply the operation of charge conjugation to equation (32.1). To do so, 
we write the equation 


bl y(p + eA) +m] =0, (32.4) 


which is obtained as the complex conjugate of (32.1), in the same way as equation 
(21.9) was derived in Chapter HI, and using the fact that the 4-vector A is real. 
Putting this equation in the form 


[7(p + eA) + m]b =0, 
multiplying it on the left by the matrix Uc and using the relations (26.3), we find 
[y(p + eA) — m](Cy) = 0. (32.5) 


Thus the charge-conjugate wave function satisfies an equation which differs 
from the original equation by a change in the sign of the charge. The operation of 
charge conjugation, however, corresponds to a change from particles to antiparti- 
cles. We see that, if the particles possess an electric charge, the signs of the 
electron and positron charges are necessarily opposite. 

The first-order equations (32.1) can be transformed into second-order equations 
by applying to them the operator y(p — eA)+™m: 


[y“y"(B, — eA, )(B, — eA,) — m*]yp = 0. 
The product y"y” may be written as follows: 
yey” =ayty’ + yy") +alyty” — y’y") 
a ili ae 


+ The transformation (32.3) with a function y(t, r) is sometimes called a local gauge transformation, 
in contrast to the global gauge transformation (12.10) with a constant phase a. 
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where o”” is the antisymmetric “‘matrix 4-tensor” (28.2). On multiplying by 0” we 
can antisymmetrize by the substitution 


(Du -— eA, (Dp, _ eA,) “Ta, X(B,, — eA, Mp, _ eA,)}- 
= 7e(—A,p, + pA, — B.A, + A,p,) 
= 4ie(0,A, — 0,A,) 


= —tieF,, 


(with F,, = 0,A,—0,A, the electromagnetic field tensor). The result is the second- 
order equation 


[((p — eA) — m’—5ieF,,0"” |p = 0. (32.6) 


The product F,,0"” may be written in three-dimensional form in terms of the 
components 


o”’=(a,id), FF" =(-E,H). 
Then 
((p — eA’ — m’*+ e%-H—ien- Ely =0, (32.7) 


or, in ordinary units, 


. 2 2 
(e2-£@) -(inv+£a) —mic?+ 23 -H-i Sa Ely=0. (32.7a) 
Cc ot c C C C 


The occurrence in these equations of terms in the fields E and H is due to the spin 
of the particle, and will be further discussed in §33. 

The solutions of the second-order equation include, of course, “redundant” 
solutions which do not satisfy the original first-order equation (32.1), being solu- 
tions of that equation with the opposite sign of m. The choice of the appropriate 
solutions in particular cases is usually obvious and causes no difficulty. The 
customary procedure is that, if @ is any solution of the second-order equation, then 
a solution of the correct first-order equation is 


w =[y(p — eA) + m]¢. (32.8) 


For, on multiplying this equation by y(p — eA)—m, we see that the right-hand side 
vanishes if @ satisfies (32.6). 

It should be emphasized that the introduction of the external field into the 
relativistic wave equation by replacing p by p — eA is not self-evident. In doing so, 
we have essentially made use of a further principle: this substitution must be 
applied to first-order equations. For this reason equation (32.6) contains additional 
terms which would not appear if the substitution were made directly in the 
second-order equation. 
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The stationary-state solutions of Dirac’s equation in an external field muy 
include states of both the continuous spectrum and the discrete spectrum. As in the 
non-relativistic theory, states of the continuous spectrum correspond to infinite 
motion, in which the particle can be at infinity; it may there be regarded as a free 
particle. Since the eigenvalues of the Hamiltonian of a free particle are +V(p + 
m’), it is clear that the continuous spectrum of energy eigenvalues is in the ranges 
e=m and ¢-x-—m. If -m<e<m, the particle cannot be at infinity, and the 
motion is therefore finite and the state belongs to the discrete spectrum. 

As in the case of free particles, the wave functions with “‘positive frequency” 
(c >0) and with “negative frequency” (« <0) appear in a definite manner in the 
second quantization procedure. For particles in an external field, there is a natural 
generalization of this procedure, the plane waves in formulae (25.1) being replaced 
by appropriately normalized eigenfunctions w% and ws of Dirac’s equation, 
corresponding to positive and negative frequencies (¢ and —«{”’): 


b= DV Lah P em! $ Hr YO |i, 
nt 
(32.9) 
WHS {GFP cl BPO emOF, 
nt 


However, as the potential well becomes deeper, the energy levels may cross the 
boundary « =0, so that positive levels become negative (or vice versa when the 
potential has the opposite sign). Nevertheless, for the sake of continuity we must 
still regard these as electron (not positron) levels. That is, the electron states are to 
be regarded as all those which approach the positive limit of the continuous 
spectrum (e = m) when the field is removed with infinite slowness. 

Although Dirac’s equation for an electron in an external field does, as already 
mentioned, yield solutions for a large class of problems in quantum elec- 
trodynamics, we must at the same time emphasize that the applicability of the 
concept of an external field in the one-particle problem of relativistic theory is 
nevertheless restricted, because of the spontaneous formation of electron—positron 
pairs in sufficiently strong fields (see §§35 and 36). 

We shall not deal in this book with the inclusion of an external field in the wave 
equations for particles with spin other than 3, since the topic has no immediate 
physical significance: actual particles having such spins are hadrons, and their 
electromagnetic interactions cannot be described by wave equations. 


PROBLEM 


Determine the electron energy levels in a constant magnetic field. 

SOLUTION. The vector potential is A, = A, =0, Ay = Hx (the field H being along the z-axis). The 
components py, pz of the generalized momentum (as well as the energy) are conserved. 

We use the second-order equation for the auxiliary function @ (see (32.8)), and assume that ¢ is an 
eigenfunction of the operator %, (with eigenvalue o = +1) and of the operators py, p.. The equation for @ 
is 


2 
{- sn + (eHx — py)’ - Ho} =(e°—-m’- p2)o. 
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This equation is the same in form as Schr6dinger’s equation for a linear oscillator. The eigenvalues of ¢ 
are given by 


e°—m’—p,=|e|H(2n+1)—eHo, n=0,1,2,...; 


cf. QM, §112. The wave function w, which is to be determined from @ according to (32.8), is not an 
eigenfunction of the operator %., in accordance with the fact that the spin is not conserved for a particle 
in motion. 


$33. Expansion in powers of 1/ct 


We have seen in §21 that, in the non-relativistic limit (v > 0), 
two components (y) of the bispinor & = ta vanish. 


Hence xy <@ when the electron velocity is small. This leads to an approximate 
equation involving only the two-component quantity ¢, obtained by a formal 
expansion of the wave function in powers of I/c. 

Dirac’s equation for an electron in an external field may be written 


ine = {ca (p-£A)+pme?+eoly, (33.1) 


The relativistic energy of the particle includes also its rest energy mc’. This must 
be excluded in arriving at the non-relativistic approximation, and we therefore 
replace w by a function w' defined as follows: 


wy = ' ev imettlh 


Then 
e O 2 , A e 2 
(it val mc uy = {ca (p-<A)+ me + eb ty! 


Substituting wb’ = ta) we obtain the equations 


+. O ae fae ; 
(ih ——e\¢ ee (6 eA)x', (33.2) 
(it = ed + 2me?)x’ ae (p-£a)e (33.3) 


The primes to @ and y will henceforth be omitted; this will cause no misunder- 
standing, since only the transformed function ws’ is used in the present section. 


+ In this section, ordinary units are used. 
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In the first approximation, only the term 2mc’y is retained on the left-hand side 
of (33.3), which gives 


(thus y ~ ¢/c). Substitution of this in (33.2) gives 


[ng -ea)o—z(o-(0-£4))' 


For the Pauli matrices we have the relation 
(a9: al(o-b)=a'bt+ia:axXb, (33.5) 


where a and b are arbitrary vectors (see (20.9)). In the present case, a=b= 
p— eA/c, but the vector product a Xb is not zero, since p and A do not commute: 


| (6-< a) x (s-£a)]o =i Ftaxv+vx a}g 
=i oua-¢. 


Thus 


(o-(p-£)) =(6-£a) -*o-n, (33.6) 


where H = curl A is the magnetic field, and for @ we obtain the equation 


9b 7, fl (2. e,\ eh 
ih =H =|5,(b-£ a) tebd-s-o Ho, (33.7) 


This is Pauli’s equation. It differs from the non-relativistic Schrodinger’s 
equation by the last term in the Hamiltonian, which has the form of the potential 
energy of a magnetic dipole in the external field (cf. QM, $111). Thus, in the first 
approximation (with respect to 1/c), the electron behaves as a particle having both 
a charge and a magnetic moment 


é 
=~ hs. (33.8) 


The gyromagnetic ratio e/mc is twice its value for a magnetic moment due to 
orbital motion.t 


+ This remarkable result was first derived by P. A. M. Dirac in 1928. The two-component wave 
function satisfying equation (33.7) was introduced by W. Pauli (1927), before Dirac’s discovery of his 
equation. 


124 Particles in an External Field §33 


The density p = w* = d*¢ + y*x. The second term must be omitted in the first 
approximation, so that p =|¢|’, as would be expected for Schrédinger’s equation. 
The current density is 


j= cy*ay 
= c(¢*oy + x*o¢). 


We substitute here, from (33.4), 


ei (—inv-£ ag, 


| 
~~) 


1 e 
* ay 2 # 
R= ae (inv ; A)¢ oC 
and transform the products containing two factors o by means of (33.5) in the form 
(o -ajo=a+io Xa, o(o:a)=atiaXxa. (33.9) 


The result is 


i= FA(Vd*— 6*V4)-— Ado += curl (H*0d), (33.10) 


in agreement with the expression in the non-relativistic theory, QM (115.4). 

Let us now derivet the second approximation, continuing the expansion as far 
as terms of order 1/c’, and assuming that there is only an electric external field 
(A = 0). 

First, we note that, when terms ~1/c’ are included, the density is 


=|¢P+|x/?= lol +3 * jirale: Vo)’. 


This differs from the Schré6dinger expression. In order to find (in the second 
approximation) the wave equation corresponding to Schrédinger’s equation, we 
must replace ¢ by another (two-component) function d¢s.4, for which the time- 
independent integral would be of the form f |ds.|’ d*x, as it should be for 
Schrodinger’s equation. 

To obtain the required transformation, we write the condition 


[ b8bsaa°x = | [o*o +2" Zina (Vs*- oo -Vs)} as 


+ The method used here is due to V. B. Berestetskii and L. D. Landau (1949). 
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and integrate by parts: 
| (W4*- 0N(o- V4) dx =— | o*(o- Wie Vo ax 
-| b*Ad d*x 
(or the same with @ and #* interchanged). Thus 
[ d8dsad’x = [ (os - Be a(62de + 440} ae, 


whence it is evident that 


bsa= (1 +gha)e. 6=(1 ~ goes) sen (33.11) 


8m?c? 8m 


To simplify the notation we shall assume a stationary state, replacing the 
operator ih d/dt by the energy e« (with the rest energy subtracted). In the next 
approximation after (33.4) we have from (33.3) 


x=5c(1- > ame eT \(o- p)¢. 


This is to be substituted in (33.2) and @ then replaced by ¢s., according to (33.11), 
omitting all terms of higher order than | ic’. A simple calculation leads to an 
equation for ¢s,, in the form edscn = Hos-h, where the Hamiltonian is 


A a 4 
e r me A 
Ps op — P+ TS Mle +p) Oo - p)—1P°® + 0B) 


The expression in the braces is transformed by means of the formulae 


(a: p)P(o : p) = Op’ + (a: p®)(a- p) 
= Op’ + ih(o : E)\(o: p), 
pd — Op’ = — h’?A® + 2ihE - Dp, 


where E = —V® is the electric field. The final expression for the Hamiltonian 1s 


A2 a4 
-  p __ jp eh eh’ 
H Sia EP Smee Anie gna ee Ex<p- Sim 2c ——75 div E. (33.12) 


The last three terms are the required corrections of order 1/c*. The first of these 
three terms is due to the relativistic dependence of the kinetic energy on the 
momentum (the expansion of the difference cV(p?+m’c”)— mc’). The second, 
which may be called the spin-orbit interaction energy, is the energy of the 


QE4- J 
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interaction of the moving magnetic moment with the electric field.+ The last is zero 
except at points where there are charges creating the external field (for instance, in 
the Coulomb field of a point charge Ze, A® = — 4mZed(r)) (C. G. Darwin, 1928). 

If the electric field is centrally symmetric, then 


and the spin-orbit interaction operator can be put in the form 


eh . aD he dU; 
dmc **P Gr amer dr” oH) 
where 1 is the orbital angular momentum operator, § = So is the electron spin 
operator, and U = e@ is the potential energy of the electron in the field. 


§34. Fine structure of levels of the hydrogen atom 


Let us determine the relativistic corrections to the energy levels of the 
hydrogen atom—an electron in the Coulomb field of a fixed nucleus.+ The velocity 
of the electron in the hydrogen atom is v/c ~ a <1. The required corrections can 
therefore be calculated by the use of perturbation theory, averaging over the 
unperturbed state (i.e. over the non-relativistic wave function) the relativistic terms 
in the approximate Hamiltonian (33.12). For somewhat greater generality we shall 
take the charge of the nucleus as Ze, assuming, however, that Za <1. 

The field of the nucleus is E = Zer/r’, and its potential satisfies the equation 
A® = — 47Ze8d(r). Substituting this in the last three terms in (33.12) and using the 
fact that the electron charge is negative, we obtain the perturbation operator 


; s§ Za. ZZ 
Va—p tar al 8+ 5 3 6tn). (34.1) 


Since, according to the non-relativistic Schrédinger’s equation, 
pw =2m (eo ale ay 
(where €9 = —mZ’a?/2n’ is the unperturbed level and n the principal quantum 


+ With the magnetic moment (33.8) and the velocity v= p/m, this energy becomes —p- EX v/2c. At 
first sight this result may appear unlikely, because on changing to a frame of reference fixed to the 
particle there arises a magnetic field H = E X v/c, in which the energy of the magnetic moment should be 
—w-°H. The occurrence of the factor 5 (the ““Thomas half’’; L. H. Thomas, 1926) is due to the general 
requirements of relativistic invariance together with the particular properties of the electron as a 
“spinor” particle with the corresponding value of the gyromagnetic ratio (see §41). 

+ The effect of the motion of the nucleus on these corrections is a quantity of a higher order of 
smallness, which will not be considered here. 
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number), the mean value is 
2 
p = 4m? (eo + =) ; 


This quantity, like the mean value of the second term in (34.1), is calculated by 
means of the formulae (see QM, $36) 


r= maZ/n’, 
r?=(maZy/n(1 +4), (34.2) 
r?>=(maZy/n3l(1+>(1 + 2), 
the last of which 1s valid if 14 0; the eigenvalue is 
I-s=4fG+)-ll+1)—-j] if 10, 


I-s=0 if | =0. 


Finally, the third term is averaged by means of the formulae 


3/2 
(34.3) 
Ws (0) = 0, lA 0. 


The result of a simple calculation using the above formulae may be written for 
all cases (for all J and |) as 


= - Zev ( 1 -7-) 
Ag as ae j+3 An ‘ (34.4) 


Formula (34.4) gives the required relativistic correction to the energy of the 
hydrogen levels; that is, it gives the fine-structure energy.f In the non-relativistic 
theory, there is both degeneracy with respect to spin direction and Coulomb 
degeneracy with respect to |. The fine structure (spin-orbit interaction) removes 
this degeneracy, but not completely: there remain levels with mutual double 
degeneracy, having the same n and j but different |=j+}. The levels with the 
maximum possible value for j for a given n, 


ieee = Imax +3 = n —4, 
are then not degenerate. Thus the sequence of hydrogen levels, with allowance for 


+ This formula, and the more exact formula (36.10), were first derived by A. Sommerfeld from the 
old Bohr theory before the development of quantum mechanics. 
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the fine structure, is 


Ls123 
2812, 2P 1/25 2 P32; 
381/25 3P 1125 33/25 33a, 3dspr; 


The level with principal quantum number n is split into n fine-structure com- 
ponents. 

In non-relativistic mechanics, the “‘accidental’”’ degeneracy of the energy levels 
in a Coulomb field is due to the existence of a conservation law peculiar to this 
field, relating to a quantity A whose operator is 


‘i r 1 Ri ctee wanes ie 
=-—+=—_ (Ix p-—pXD: 
A=—+57 7 xp—pxD; 
see QM, (36.30). This specific conservation law is also responsible for the twofold 
degeneracy which remains in the relativistic case: the Hamiltonian H = 
a: p+ Bm —e’/r of Dirac’s equation commutes with the operator 


p=t-y+— pey-i+ Dy(H — mp) 


(M. H. Johnson and B. A. Lippmann, 1950). In the non-relativistic limit, [ > % + A. 

We shall see later ($123) that this remaining degeneracy is removed by “‘radia- 
tive corrections” (the Lamb shift), which are neglected in Dirac’s equation for the 
single-electron problem. 

To anticipate, it may be mentioned here that the order of magnitude of these 
corrections is mZ‘a° log (1/a). The second-order spin-orbit interaction correction 
would be ~m(Za)*, and the ratio of this to the radiative corrections is therefore 
~Z’allog (1/a). For hydrogen (Z = 1), this ratio is certainly small, and the exact 
solution of Dirac’s equation is therefore of no significance in that case, but it may 
be significant as regards the electron energy levels in the field of a nucleus with 
large Z (§36). 


§35. Motion in a centrally symmetric field 


Let us now consider the motion of an electron in a centrally symmetric electric 
field. 

Since the angular momentum and the parity (relative to the centre of the field, 
which is taken as the origin) are conserved in a central field, the discussion in §24 
regarding spherical waves of free particles is entirely applicable to the angular 
dependence of the wave functions of such a motion; only the radial functions vary. 
Accordingly, we shall seek the wave function of the stationary states (in the 
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standard representation) in the form 


4 (= (ape lito) est 


where | =j +3, l'=2j—1, and the power of —1 is chosen for subsequent con- 


venience. 
Dirac’s equation in the standard representation yields the following equations 


for @ and y: 


fae —U —_ nn : 
(e-m—U)d=oa * aoe 


(e+m-—U)x =a: pd, 


where U(r) = e®(r) is the potential energy of the electron in the field. The result of 
substituting the expressions (35.1) is calculated by evaluating the right-hand sides 


of these equations. 
Expressing the spherical harmonic spinor Qj in terms of Ojim by 


r 
(see (24.8)), we can write 
ee ; g 
(o- p)y =—Uo- p)(o-r) ; Odin: 


Now transforming the product (o: p)(o:r) by means of the formula (33.5), and 
expanding the vector operators, we have 


(6+ B)y = —i{p-r tio pXr} = On 
={-divr—(r-V)—o +r xp} 2 Om 
? . 
=-le'+fetho. I} Om, 


where 1=rxX p is the orbital angular momentum operator; the prime denotes 
differentiation with respect to r. The eigenvalues of the product o@ -1=2I1-s are 


21-s=jy-Pr-s 


=jj+D-10+ 1-3 
1 
2 


In order to be able to use the same formulae for both cases (1 =j +3), it is 
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convenient to write 


kK = —-(j+3)=—-(1 41) for j=1+3, 
(35.3) 
2 


=+(j+)=1 for j=1~3. 
The number « takes all integral values except zero, the positive numbers cor- 


responding to the case j = | —3, and the negative numbers to the case j = 1 +4. Then 
I-o =—(1+k), and 


‘ / I- 
(o-p)x =- (2 + 2) 0m 


When this expression is substituted in the first equation (35.2), the spherical 
harmonic spinor Qjm cancels from the two sides. Proceeding similarly with the 
second equation, we finally obtain the following equations for the radial functions: 


f+ ttf —(6+m—U)g =0, 
(35.4) 
g' +o + (e-m-U)f =0. 
Or 
(fry += (fr) ~(e +m — U)gr = 0, 
(35.5) 


(gr)' — = (er) +(e —m —U)fr =0. 


Let us examine the behaviour of f and g at small distances, assuming that the 
field U(r) increases more rapidly than 1/r as r>0. Then, for small r, equations 
(35.4) become 


f'+Ug=0,  g'—Uf =0. 


These have real solutions, of the form 


f = constant X sin([ U dr+ 5), 
(35.6) 
g = constant X cos( | U dr + a), 


where 6 is an arbitrary constant. These functions oscillate as r-0, but tend to no 
limit. It is easy to see that this situation corresponds, in the non-relativistic theory, 
to the “‘fall’’ of a particle to the centre. 

First of all, we note that the smallness of r here places no restrictions on the 
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choice of solution, since there is no condition at r = 0 for the oscillatory function, 
and so the choice of 6 remains arbitrary (the correct behaviour of the wave 
function for large r can be ensured by an appropriate choice of 6, for any value of 
e). This indeterminacy can be eliminated by regarding the potential with a sin- 
gularity (at r=0) as the limit, when ro 0, of a potential cut off at some 7p, Le. 
equal to U(r) for r>rp and U(r) for r<ro. With ro finite, we of course obtain a 
definite set of energy levels, but the energy of the ground state tends to —© as 
ro 0. 

In the non-relativistic theory, this signifies a “‘fall”’ to the centre, since a particle 
at a deep level is localized in a small region near r = 0. In the relativistic theory, 
such a situation is impossible, since it would imply that the system was unstable 
with respect to the spontaneous generation of electron—positron pairs. For, whereas 
an energy exceeding 2m is needed to create such a pair in a vacuum, a smaller 
energy is sufficient in a field. In the presence of an electron bound state with energy 
€ <m, a pair can be formed by expending an energy ¢ + m < 2m, the result being a 
free positron and an electron in a bound state. If, on the other hand, the bound 
state energy is ¢ <—m, such a field can create positrons (with energy —e >m) 
spontaneously, without taking energy from an external source. In the field under 
consideration, as ro—>0, there is an infinity of such ‘‘anomalous” levels with 
e <—m. Thus fields whose potential ®(r) increases more rapidly than I/r as r>0 
cannot be dealt with by means of Dirac’s theory. This applies to potentials of either 
sign. Although a “fall”? can occur, of course, only with attraction, the sign of 
U =e@® depends also on that of the charge, so that electron levels behave 
anomalously in one case and positron levels in the other; in the latter case, the field 
generates free electrons. 

Let us next consider the behaviour of the wave functions at large distances. If 
the field U(r) decreases sufficiently rapidly as r->©, it may be entirely neglected in 
the equations when determining the asymptotic form of the wave functions at large 
distances. When e€ >m, 1e. in the continuous spectrum, we then return to the 
equation of free motion, so that the asymptotic form of the wave functions 
(spherical waves) differs from that for a free particle only by the presence of 
additional ‘“‘phase shifts’’, whose values are determined by the form of the field at 
short distances.t These shifts depend on the values of j and |; that is, on the 
number « defined above (and also, of course, on the energy <). Denoting the phase 
shifts by 6, and using the expresssion (24.7) for a free spherical wave, we can 
therefore immediately write down the required asymptotic formula: 


pr? 1 ( Ve + m)Qjm sin (pr — xl + 6,) (35.7) 
rv (2e)\-V(e — mM)Qjirm Sin (pr —5l'7 + 6,))” 
or, with the definition (35.1), 
‘ - Ae cei sin, 4 
of = cos (Pr 2h + 8x), (35.8) 


+ Cf. QM, §33. As in the non-relativistic theory, U(r) must decrease more rapidly than 1/r. The case 
U ~ 1/r will be discussed separately in $36. 
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where p = V(e’?—m?’). The common coefficient here corresponds to the nor- 
malization of the radial functions by (24.5). 

The wave functions of the discrete spectrum (e < m) decrease exponentially as 
ro: 


pa ft 9 = At expt-rVin?— o 85.9) 


where Ay is a constant. 

As in the non-relativistic theory, the phase shifts 6, (more precisely, the 
quantities e”« — 1) determine the scattering amplitudes in a given field, as will be 
further discussed in $37. We shall not pause to investigate here the analytical 
properties of these amplitudes (cf. QM, §128), but merely note than e”« again has, 
as a function of energy, poles at the points corresponding to the levels for bound 
states of the particle. The residue of e” at such a pole is related in a certain 
manner to the coefficient in the asymptotic expression for the corresponding wave 
function of the discrete spectrum. This relation may be found by a generalization 
of the non-relativistic formula, QM (128.17). The necessary calculations are 
entirely similar to those in QM, $128. 

Differentiation of equations (35.5) with respect to the energy gives 


(A 4 SD Ce +m - u) oe) 


OE r 0¢E "8: 
a(rg)\, _« A(rg) a(rf) _ 
a ee ag 


We multiply these two equations by rg and —rf respectively, and the two equa- 
tions (35.5) by —rg and rf, and add all four term-by-term. After simplification, we 
have 


alee) -reve 
ah Be oe) ee) 
We now integrate with respect to r: 
2 Of 8) - | 2 Do 
P(e Zt im (f° + g°)r° dr, 
0 


and take the limit as ro. The integral on the right tends to unity, by the 
normalization condition. On the left-hand side, we use the fact that in the asymp- 
totic region the functions f and g are related by 
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which is derived from (35.5) by neglecting terms in U and in 1/r. The result is 


al fy oD - (2) =e (35.10) 


This formula differs only in the coefficient (e +m replacing 2m) from the 
corresponding non-relativistic formula (for the function vy). There is therefore no 
need to repeat the subsequent calculations; the final formula, valid near the point 
€ = Eo (where € is the energy level), is 


Fs, 


2A; Mm — Eo 


215, — (_.1)! 
i Ca rg erat (35.11) 


where Avg is the coefficient in the asymptotic expression (35.9). 


PROBLEM 


Find the limiting form of the wave function for small r ina field U~r*, s <1. 

SOLUTION. For a free particle we have, when r is small, f~r', g~r', so that f >g if 1<I', and 
f <g if |>I'. We make the assumption (which is confirmed by the result) that this relationship exists 
also in the field considered here. If |< I' (i.e. 1 =J —3, k = —I1-— 1), the term in g may be omitted from the 


[+1l~s 


first equation (35.4), so that f ~ r' as before. The second equation then gives g ~ rfU, ie. g~r = 
r'-’. The case | >I’ is treated similarly. The result is 


for] <l’, f~ r', a pes 


for | >I, f~r, e~r'. 


§ 36. Motion in a Coulomb field 


We shall begin the study of the properties of the motion in the very important 
case of a Coulomb field by considering the behaviour of the wave functions at 
short distances, taking the particular case of an attractive field: U = —Za/r.t 

For small r, the terms in e€ +m may be omitted in equations (35.5), leaving 


pi Rope SO =. 
(fry +— fr eh 0, 


(gry —“ gr + fr =0 


The two functions fr and gr appear in an equivalent manner in these equations, and 
we therefore seek them as equal powers of r: fr=ar’, gr = br”. Substitution gives 


a(y + k)— bZa = 0, aZa + b(y— kx) =90 
whence 
y° =k’ —(Za)’. (36.1) 


+ In ordinary units, U = — Ze’/r. In relativistic units, e” is replaced by the dimensionless quantity a. 
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Let (Za)’< x’. Then y is real, and the positive value must be taken: the 
corresponding solution either does not diverge at r =0 or does so less rapidly than 
the other. The choice may be justified by considering a potential which is “cut off” 
(see §35) at a certain small ro and then taking the limit ry>>0; cf. the analogous 
discussion in QM, §35. Thus 


Za 1+ 
= —— 9g = constant X - 
f [eee pe 


y= V(K?— Za’) = VG +3" - Za"). 


(36.2) 


Although the wave function may become infinite at r=0 (if y < 1), the integral of 
|¥|? remains finite, of course. 

If (Za) >’, both values of y given by (36.2) are imaginary. The corresponding 
solutions oscillate as r7' cos(|y| log r) when r—0, and this again corresponds to a 
situation which is inadmissible in the relativistic theory, as already shown above. 
Since k’?>1, this means that a purely Coulomb field can be discussed in Dirac’s 
theory only if Za <1, i.e. Z < 137. 

Let us now give a qualitative description of the situation which arises when 
Z > 137. In order to avoid an indeterminacy in the boundary condition at r= 0, we 
must again consider a potential cut off at a distance ry (I. Ya. Pomeranchuk and Ya. 
A. Smorodinskii, 1945). This has a direct physical significance as well as a formal 
one. The charge Z > 137 can in practice be concentrated only into a “superheavy” 
nucleus of finite radius. Let us therefore see how the configuration of levels varies 
as Z increases for a given Io. 

In the Coulomb field when not cut off, the energy ce, of the lowest level tends to 
zero When Za = 1, and the e,(Z) curve terminates, the level ¢, becoming imaginary 
for Za > 1; see (36.10) below. In the cut-off field, for a given rm #0, the level «, 
passes through zero only for some Za>1. The value ¢,=0 has no physical 
distinctiveness, and when rn) #0 it also has no formal distinctiveness, the ¢,(Z) 
curve not terminating there. When Z increases further, the levels continue to 
descend, and at a certain “critical”? value Z = Z.(ro) the energy e, reaches the 
bottom (—m) of the lowest continuum of levels. As explained in §35, this means 
that zero energy is required for the creation of a free positron. The critical value Z, 
is therefore the maximum charge that the “bare’’ nucleus can have for a given I. 

When Z> Z,, the level €;<~—m, and the formation of two electron—positron 
pairs becomes energetically favourable. The positrons go to infinity, carrying 
kinetic energy 2(|e,]— m), and the two electrons occupy the level ¢). This gives an 
“ion”? with an occupied K shell and a charge Z.g = Z —2 (S. S. Gershtein and Ya. 
B. Zel’dovich, 1969). The system is stable for Z > Z,, up to values of Z for which 
the limit —m reaches the next level.t 

Lastly, it may be noted that, even for a point charge, the form of the potential at 
short distances is affected by radiative corrections, but the resulting corrections to 
Z.a are only of the order of a. 


+ For example, if the nuclear charge is uniformly distributed in a sphere with radius ro= 
1.2x 10°" cm, the critical value Z. = 170, and the next level reaches the limit -m when Z = 185 (V. S. 
Popov, 1970). A detailed account of the quantitative theory is in the review article by Ya. B. Zel’dovich 
and V. S. Popov, Soviet Physics Uspekhi 14, 673, 1972. 
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Let us now turn to the exact solution of the wave equation (C. G. Darwin, 1928; 
W. Gordon, 1928). 
(a) Discrete spectrum (e« <m). We seek the functions f and g in the form 


f = Vim +e) ep’ (Qi + Q)), 


| (36.3) 
g=—V(m-—e)e *p (Qi — Qy), 


with the notation 


p=2Ar, A=V(m'—-&), y=V(K?—-Z’a”). (36.4) 


This is reasonable, since we already know the behaviour of the functions as p>0 
(36.2) and their exponential decrease (~e *) as p>. Since, as p—>®%®, the func- 
tions f and g must have the same asymptotic behaviour, we must expect that 
Q,>Q, as p>. 

Substitution of (36.3) in (35.4) yields the equations 


p(Qi + Qo)’ + (y + K)(Qi + Qz) — pQr + Zar)™ 7 : (Qi ~ Qr) = 0, 
p(Qi~ Qa)’ + (y ~ K(Qi~ Q3) + PQ - Zay™*£(Q,+ Q) =O, 


where the prime denotes differentiation with respect to p. The sum and difference 
of these give 


pQi + (y - 229, + (. - =att) Q, = 0, 


pQs+ (y + fas = p )Qr+ (« +a a, = 0, 


(36.5) 


or, eliminating either Q; or Q, 


Z 
pQi + 2y +1-p)Qi- (y¥-=)ai=0, 
tr t Zae a 
pQ3+ Qy +1—p)Qi-(y+1-")a.=0, 


where we have used the fact that y?—(Zae/A)? = «’—(Zam/A). The solution of 
these equations which is finite when p = 0 is 


Qe 


Qi= AF(y- 25, 27+ 1,9), 


(36.6) 


Qe 


Q:= BF(y+1-= 27 +1), 
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where F(a, B, z) is the confluent hypergeometric function. Putting p = 0 in either of 
the equations (36.5), we obtain the relation between the constants A and B: 


_ ya Zaelr A (36.7) 


B= k-Zam/[rA 


The two hypergeometric functions in (36.6) must reduce to polynomials, since 
otherwise they would increase as e? when p>, and the wave function itself 
would increase as e”. The function F(a, B, z) reduces to a polynomial if @ is a 
negative integer or zero. We write 


y — Zae/A = —n,. (36.8) 


If n,=1,2,..., the two hypergeometric functions reduce to polynomials. If n, = 0, 
only one of them does so. In that case, y = Zae/A, and Zam/A = |x|, as is easily 
verified. If k <0, the coefficient B (36.7) 1s zero, so that Q. = 0 and the necessary 
condition is satisfied. If k >0, however, then B =—A, and Q), remains divergent 
when n, = 0. Thus the following values are possible for the quantum number n,;: 


n,=0,1,2,... for« <0, 
(36.9) 


=1,2,3,... for« >0. 


The definition (36.8) then yields the following expréssion for the discrete energy 
levels: 


a i! "Wie? GaF + aPl — 
In particular, the energy of the 1s; ground level (|x| = 1, n, = 0) is 
€,)=mV[1—(Za)’]. 
For Za <1, the leading terms in the expansion of (36.10) are 


Boge Go fn oy [1 3. 
m oe racers +r alta Marat 


a?) 


On writing n,+|«|=n (=1,2,...) and noting that |x| =j +3, we return to formula 
(34.4), which was previously derived by means of perturbation theory. As already 
mentioned at the end of $34, the further terms in this expansion have no 
significance, since they are certainly exceeded by the radiative corrections. For- 
mula (36.10) as it stands, however, is meaningful when Za~1. The double 
degeneracy of the levels shown by the approximate formula (34.4) exists also in the 
exact formula, which involves only |x|, so that levels with the same j and different | 
again coincide. 

We have still to determine the common normalization factor A in the wave 
function. The wave function of the discrete spectrum must, as usual, be normalized 
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by the condition f ||? d’x = 1; the corresponding condition on the functions f and g 
1S 


[@e+ear dr = 1. 
0 


The value of A is most simply determined from the asymptotic form of the 
functions as r-. Using the asymptotic formula 


PQy +1) 


F(-n,, 2y + 1, p) = Tan, t+2y+p° ?)" 


(see QM, (d. 14)), we find 


f~(-)"AV(m + eg tere 


a ON). 

Comparing this with (36.22) derived below, we can find A. Collecting together the 
formulae, we can write out in full the final expressions for the normalized wave 
functions: 


fg = PQy + 1)L4m(Zam/A)(Zam/A — k)- n,! (2Aryr eo" x 
x { (2am _ « JF n,, 2y +1, 2Ar)#n,F(1—n,,2y +1, 2aryt, (36.11) 


where the upper signs refer to f and the lower signs to g. 

(b) Continuous spectrum (¢ >m). There is no need to solve the wave equation 
afresh for the states of the continuous spectrum. The wave functions for this case 
are obtained from those of the discrete spectrum by the substitutionst 


V(m —&)>-iV(e —m), A\—>-—ip, —n, > y —iZaelp; (36.12) 


see QM, $128 concerning the choice of sign in the analytical continuation of the 
square root V(m — «). The normalization of the functions must, however, be done 
again. 

Making these substitutions in (36.11), we may write 


f=V(e+m) 


x [e*F (y — iv, 2y + 1, —2ipr) ¥ e *F(y +1— iv, 2y + 1, —2ipr)], 


+ In the rest of this section, p denotes |p| = V(e*— m”). 
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where A’ is another normalization constant, 


Ya 
K — ivm|e’ 


v = Zaelp, ge = (36.13) 


the value of & is real, since y* + (Zae/p)’ = x*+(Zam|p)’. 
According to the formula 


F(a, B, Z) = e’F(B — a, B, 2) 
(see QM, (d. 10)), we have 


F(y +1-—iv, 2y +1, —2ipr) = e 7”'F(y + iv, 2y + 1, 2ipr) 
=e PTF *(y — iv, 2y +1, —2ipr). 


Hence 
f,g =2iA'V(e + m)(2pr)”! im, refe!?"*® F(y — iv, 2y +1, —2ipr)}. (36.14) 


The normalization coefficient A’ is found by comparing the asymptotic expression 
for this function with the general formula (35.7) for a normalized spherical wave. The 
resulting expression for the wave functions of the continuous spectrum (which we 
shall afterwards verify) is 


_nyp_fmte 1,[(y+1+iv)| Qpry 
fig=2 J - ©  iOyai oF 

x im, re{e'?" 9 F(y — iv, 2y + 1, —2ipr)}. (36.15) 
The asymptotic expression for this function is derived by means of QM (d. 14), 


where only the first term is now significant, the second decreasing as a higher 
power of 1/r: 


f,g= Me at sin, cos(pr + 6, + v log 2pr—3l7), (36. 16) 


E 
where 
6, = €-arg (y+ 1+iv)—pry +4ln, (36.17) 
Or 


215, K — ivm/e '(y +1— iv) ointl-y) 


y—-iv T(yt+t+iv) ree) 


€ 


For future reference, we shall give an expression for the phase in the ultra- 


+ The wave functions for a repulsive field are obtained by changing the sign of Za, i.e. that of v. 
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relativistic case (ec > mM, v = Za): 


215, K I'(y + 1— iZa) einl-y) 
y —iZa T(y + 1+ iZa) 


e (36.19) 


The expression (36.16) differs from (35.8) only by the logarithmic term in the 
argument of the trigonometric function. As in Schrodinger’s equation, the slowness 
of the decrease of the Coulomb potential affects the phase of the wave, which 
becomes a slowly varying function of r. 

Analytical continuation into the region « <_m gives, in place of (36.18), 


rs, _K —ZamlA (y+ 1—ZaelA) | igg—y 
oe y —Zae/A T(y +1+ Zae/dr) Coe) 
This expression has poles at the points where y+ 1— Zae/A =1—n,, n, =1,2,... 
(poles of the gamma function in the numerator), and also at the point y — Zae = 
—n, = 0 (if also x <0); these points coincide with the discrete energy levels, as they 
should. 
Near any of the poles with n,# 0, we have 


BOS cu (Zam/d a K) ge) 


ahOy+ tS I(y + 1- Zae/A). 


€ 


The form of the gamma function near its pole is found by means of the familiar 
formula ['(z)l(1 — z) = a/sin az: 


ZQE aT 
(y+ r eas sin (y+ 1— Zae/A)’ 
sin 7(y + 1— Zae/A) = 7 COS TH, ° <-(=2) ‘(€ — Eo) 


= (-1)"(mZam’/A°)(e — €0), 
where €é9 is the energy level. Thus we havet 


e'(Zaml/A—K) A? 1 
n,'T(2y+1+n,) Zam? € — €9 


ei. =~ (J) (36.21) 


At the end of §35 a formula (35.11) was derived which relates the residue of the 
function e”* at its pole to the coefficient in the asymptotic expression for the wave 
function of the corresponding bound state. For a Coulomb field, however, the 
formula (35.10) must be slightly modified, because the constant phase shift 6, in 
(35.7) is replaced in (36.16) by the sum 6, + v log2pr. We must therefore replace 
e**« on the left-hand side of (35.11) by 


exp(2i6, + 2iv log 2pr) > e7*«(2iAr) 7". 


+ This formula is easily seen to be valid even if n, = 0. 
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Using (36.21) and determining from (35.11) the coefficient Ap (which will now be a 
power function of r), we find the asymptotic form of the normalized wave function 
of the discrete spectrum: 


5 eo 
Jean eae (36.22) 


Pe (Zam/X —«)(m + €)A? 
2n,'ZamT(2y +1+N,) 


This has already been used to determine the coefficient in (36.11). 


§37. Scattering in a centrally symmetric field 


The asymptotic expression for the wave function which describes the scattering 
of particles in the field of a fixed centre of force may be writtent 


ipr 


= Usp eC”? + ly = API) 


— 
Here u,, is the bispinor amplitude of the incident plane wave. The bispinor uéy is a 
function of the direction of scattering n’, and for any given value of n’ its form (but 
not, of course, its normalization) is the same as that of the bispinor amplitude of 
the plane wave propagated in the direction n’. 

We have seen in §24 that the bispinor amplitude of the plane wave is entirely 
determined by specifying a two-component quantity, the three-dimensional spinor w, 
which is the non-relativistic wave function in the rest frame of the particle. The flux 
density is expressed in terms of the same spinor, and is proportional to w*w, witha 
proportionality coefficient which depends only on the energy e and is therefore the 
same for the incident and scattered particles. The scattering cross-section is 
da = (w'*w'/w*w) do or, if as in §24 the incident wave is normalized by the condition 
w*w = 1, 


da = w'*w’ do. 
We define the scattering operator f by 
w'= fw. (37.2) 


Since the quantities w, w’ have two components, the operator thus defined is 
exactly analogous to the operator scattering amplitude which appears in the 
non-relativistic scattering theory taking account of spin (QM, 8140). We can 
therefore apply immediately the formulae derived there which express the operator 
in terms of the phase shifts of the wave functions in the scattering field. It is only 
necessary to transform these phase shifts by expressing 5; and 6; from QM, §140, 
in terms of the phase shift 6, which appears in the relativistic formula (35.7). The 
phases 6; and 6; referred to states with orbital angular momentum | and total 


+ In §§37 and 38 p denotes |p|, and « and p will be written separately as suffixes to the amplitude. 
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angular momentum j =1+3 and j =1—3. According to the definition (35.3), « = 
—l- 1 for j =1+3 and «x =1 for j =1—34. We must therefore make the changes 


8/ > 6_a+p; 6; > 6 


and remember that the suffix to 6 now represents the value of x. Thus we find 


f=A+Bv-a, (37.3) 

A= sip [(1 + 1)(e7®--! — 1) + 1(e2® — 1) ]P,(cos 8), (37.4) 
1=0 

B= 5p p> (e7-1 — @7) Pl(cos 8), (37.5) 


where v is a unit vector in the direction of n Xn’. 

Since w is the spinor wave function in the rest frame, the polarization proper- 
ties of the scattering are given in terms of f by the same formulae as in QM, $140. 

For a Coulomb field, it is possible to express both functions A(6@) and B(@) in 
terms of one function. The calculation is briefly as follows.t 

In a Coulomb field, the phases 6, are given by (36.18), which we write in the 
form 


Dp K 
or | (37.6) 
— FAY TW) og in(\x\-y). 
Ce Re ae 
e'™ = e'™ when x >0, and e’” = —e'™ when x <0. Using the quantities thus defined 
we can put the series (37.4), (37.5) in the form 
2 
A(6) == G(6)— i 2" F(O), 
| (37.7) 
B(@) = ~ 5 tan 30 - G(@)+ — cot 4 - F(6), 
where 
G(6) =3i p> CP; + Pi-1), F(6) =3i 2, IC\(P; — Pi-1). (37.8) 


In transforming the series B(6), we have used the following recurrence relations 
between Legendre polynomials: 


Pi+PL,=-—cot3@-I(P, — P.-,), (37.9) 
Pi—PlL,=tan30-I(P,+ P,_,). (37.10) 


+ R. L. Gluckstern and S. R. Lin, Journal of Mathematical Physics 5, 1594, 1964. 


QE4-K 
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According to the identity 


(1+ cos @) ohn [Pi(cos 6) — P,_\(cos 6)] = I[P;(cos @) + P,_;(cos 6)], 
(37.11) 
the functions F(@) and G(@) are related by 
nso Coy ising OE, 
G =(1-—cos ea cots 4 Th (37.12) 


Thus A(@) and B(@) are expressed in terms of the single function F(6).t 


§ 38. Scattering in the ultra-relativistic case 


We shall now discuss separately the scattering in the ultra-relativistic case 
(e >m). In the first approximation, we neglect altogether the mass m in the wave 
equation. It is convenient to use for & the spinor representation 


since the equations for € and y are separable when m = 0: 


—lo -VE=(e— U)E, 
(38.1) 


—lo:Vyn=—(e-—U)y 


(the ‘“‘neutrino”’ form, §30). 
A helicity state of an electron polarized in the direction of p corresponds to 


a wave function & = 6) and for polarization opposite to p we have w = ie) 


0 


Since the equations for € and y are separable, it is evident that this property is 
unaffected by scattering. Thus helicity is conserved in the scattering of ultra- 
relativistic electrons. From considerations of symmetry (longitudinal polarization) 
it is obvious that there is no azimuthal asymmetry in the scattering of helical 
(longitudinally polarized) particles. We can also say that the scattering cross- 
section of helical electrons is independent of the sign of the helicity; this follows 
because a central field is invariant under inversion, while the sign of the helicity ts 
reversed. 

In the ultra-relativistic case, formulae (37.3)-(37.5) may be considerably sim- 
plified (D. R. Yennie, D. G. Ravenhall and R. N. Wilson, 1954). 


+ The function F(@) cannot be expressed in a closed form in terms of the elementary functions, but 
it can be written as a certain double integral; see the paper cited in the last footnote. 
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Let the incident electron be polarized, say in the direction of motion n. For a 
plane wave with a definite value of n-o, the spinor é (=(¢ + x)/V2) is propor- 
tional to the same three-dimensional spinor w as appeared in the standard 
representation of the wave. The relation between the spinor amplitudes of the 
incident and scattered waves in the new representation is therefore given by the 
Same operator f. 

As a result of the scattering, the polarization is rotated with the momentum to 
the direction n’. The effect of the operator f on the spin wave function of the 
electron therefore reduces to a rotation of the spin through the angle @ (between n 
and n’) about the axis v. This rotation is itself equivalent to a rotation of the 
coordinates about that axis but in the opposite direction, i.e. through an angle — @. 
Hence it follows that the operator f must be the same (apart from a factor) as the 
operator which transforms the wave function when the coordinates are changed in 
the way described, 1.e. the operator (18.17) with —@ instead of 6. A comparison of 
(37.3) with (18.17) shows that 


B/A = —i tan30. (38.2) 

Thus, in the ultra-relativistic limit, 
f = A(@)[1—itan3@-v-o]. (38.3) 
The expression (37.4) for A(@) can also be simplified if a relation between 6, 
and 6_, which exists in the ultra-relativistic limit is used. To derive this relation, we 


note that, when the terms in m are omitted, the equations (35.4) for the functions f 
and g become invariant with respect to the changes 


K—>—K, f-g, g->-f, 


which do not affect the parameters of the particle or field itself. We must therefore 
have f,/2, = —g_,./f-,, and substitution of the asymptotic expressions gives 


tan(pr —3la + 6,.) = —cot(pr —31'm + &_,), 
6, = 6, —(l'— Da +(n +3), 
whence 


e 7b. = e 7b. (38.4) 


From this relation, and replacing the summation variable | by | — 1 in the first term 
of the sum in (37.4), we find 


A(@) = im S I(e* — 1)][P)(cos 6) + P)-,(cos 6)]. (38.5) 
[=1 


From (38.2) it follows that re(AB*)=0. Hence, in the approximation con- 
sidered, the cross-section is independent of the initial polarization of the particles, 
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and an unpolarized beam remains unpolarized after scattering (see QM, (140.8)- 
(140.10)). We may also note that, when @ > 7, the expression (38.5) for A(@) tends 
to zero as (7 — 9) (since P,(—1) = (—1)'). The cross-section 


: = |AP +|BP =|A(@)P/cos? 36 (38.6) 


therefore tends to zero also. These properties do not occur, of course, in higher 
approximations with respect to the small quantity m/e. In particular, analysis shows 
that as 6 > 7 the cross-section tends to a limit proportional to (m/e)’. 

For a Coulomb field in the ultra-relativistic case, the phases 6, are independent 
of the energy, as is seen from (36.19).f Hence, in a purely Coulomb field, the 
scattering cross-section for « >m has the form 


2 Oe, (38.7) 


E 


where 7 1s a function of the angle only. 


§39. The continuous-spectrum wave functions for 
scattering in a Coulomb field 


In later sections (§§95, 96) we shall consider various inelastic processes which 
occur when ultra-relativistic electrons are scattered in the field of a heavy nucleus 
(Za ~ 1). To calculate the relevant matrix elements, we need wave functions whose 
asymptotic form (as r->©) is the sum of a plane wave and a spherical wave. 

We shall see that, in the ultra-relativistic case (electron energy ¢ > m), the most 
significant values of the momentum transfer from electron to nucleus in scattering 
are q = |p’— p|~ m. These values of q correspond to impact parameters p ~ 1/q ~ 
1/m, the electron being deflected through angles 


6~q/p~mle. (39.1) 


In terms of the coordinates r (distance from the centre) and z=rcos 6, this 
represents the region 


p =r sin 6 ~ 1/m, p(r—z)=pr(i-—cos 6)~ 1, (39.2) 


and r~ e/m’, so that the distances concerned are large. 
We write Dirac’s equation in the form 


(ce -U-mB+ia-VYsb=0, U=-—Zalr, (39.3) 


and transform it into a second-order equation by applying the operator «— U + 


+ This is also evident directly from equations (38.1), since for a Coulomb field the energy ¢ may be 
eliminated from the equations by the substitution r-r’/e. 
£ In this section, p denotes |p]. 
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mB —ia-V: 
(A + p*—2eU)b = (—ia: VU — UU’). (39.4) 


Since r>Za/e in the region considered, U <e. As a first approximation, the 
right-hand side of (39.4) may be neglected. The remaining equation, 


(A + p?+2eZa/r)b = 0, (39.5) 


is of the same form as the non-relativistic Schrodinger’s equation in a Coulomb 
field: 
Za\, _ 
(5 A+ p22), =0, (39.5a) 


differing only in an obvious change in the notation for the parameters (the 
“potential energy” containing an extra factor e/m). We can therefore write down 
immediately the solution which has the required asymptotic form (see QM, $136). 

For example, the wave function which asymptotically comprises a plane wave 
(x e'?'') and an outgoing spherical wave is 


WQ=C TBS e e iP: "F(<22 =,1, i(pr— pr) 


(39.6) 
C = eT (1 — iZae/p), 


where F is the confluent hypergeometric function and u,, the constant bispinor 
amplitude of the plane wave, normalized by the condition stated earlier (23.4): 


UepUcp = 2M. (39.7) 
The wave function (39.6) is normalized in such a way that the plane wave in its 
asymptotic limit has the usual form, 
u 


Veo" 


corresponding to “‘one particle in unit volume’’. Since p ~ « in the ultra-relativistic 
case, we can write Zae/p ~ Za in (39.6): 


yO = CWEa* e'?'"F(iZa, 1, i(pr—p-r)), 


(39.8) 
C = e”""T(1 — iZa). 


It should be noted that, although we are considering distances so large that 
pr > 1, the hypergeometric function in (39.8) cannot be replaced by its asymptotic 
form: the argument of F is not pr but pr(1—cos 6), which is not assumed large.t 


+ In QM, 8135, we were concerned with arbitrarily large r, and this approximation was therefore 
allowable for all values of 6. 
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In applications, the next approximation for w is also needed, which has a spinor 
structure different from (39.8) (the latter reducing to the factor u,,). To calculate 
this approximation, we write w in the form 


C ip-r 

Y= 76) °° (UepF + @). 
On the right-hand side of (39.4) we now retain the term linear in U, obtaining for ¢ 
the equation 

(A +2ip:-V—2eU)d = —iu,,a- VU. (39.9) 

The solution of this may be found by noticing that the function F satisfies the 
equation 

(A +2ip:-V—2ceU)F =0, 
as may be seen by substituting (39.6) in (39.5). Applying the operator V to this 
equation, we obtain 


(A +2ip- V—2eU)VF =2eF VU. 


A comparison with (39.9) shows that 
ee reo 
b= - (a> V)UcpF. 


The final expressions for # and for a similar function # whose asymptotic 
form contains an ingoing spherical wave are 


C } I 
‘Co ip«r sori canon : : ° ee 
oh (De) ° (1 5, V)F (Za, 1, (pr P°r))Ucp, 


* < 
po = oy er" oe Qa: V)F(-iZa, 1,—i(prt+p-:-r))up, 


C= e"™"T(1 — iZa) 


(39.10) 


(W. H. Furry, 1934). We shall also write out the corresponding functions (wW_,,-») 
with “negative frequency”, which are needed when dealing with processes which 
involve positrons. These can be derived from the functions w,, by the substitutions 
p-—p, ¢>~—«, with p = |p| unchanged; the parameter iZa of the hypergeometric 
function therefore changes sign, as will be seen from the original expression (39.6), 
where this parameter occurs in the form iZae/p. Thus we have 


i 


C ; 
(+) — —ip-r : V — : + 7 
We oy (2¢) € (1 Ve Qa )F¢ iZa, 1, i(pr p Cusp 


a (39.11) 


: Co . 1% 
( ) —_— ip:r . ° —_ e a . 
Wie (Qe) e€ (1 + 5, V)F(iZa, Lr ph) ts 


C=e "11+ iZa). 
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The following comment is necessary regarding the above calculations. Our 
asymptotic condition is not in itself sufficient to provide a unique choice of the 
solution of the wave equation; this is clear, since we can always add to wW any 
outgoing Coulomb spherical wave without violating the condition. By writing the 
solution of equation (39.5) in the form (39.6), we have tacitly presupposed the 
choice of a solution finite at r= 0. This requirement was necessary in QM, §§135, 
136, where we were considering solutions, valid in all space, of the exact 
Schrédinger’s equation.f In the present case, however, equation (39.5) applies only 
to large distances, and therefore the choice of solution demands further 
justification. 

This is provided by the fact that large impact parameters p = r sin 8 correspond 
to large orbital angular momenta | and small scattering angles 6: when p ~ I/m, we 
have 


l~ pp ~ pe~ e/m>1, 
and the angle 6 may be estimated by a quasi-classical procedure: 


gi fata, Uwe m2, 
pJ dr p € 


Thus, in the expansion of w& in terms of spherical waves the main contribution (in 
this range of r and @) will come from waves with these large values of |. But a 
spherical wave with large | will certainly decrease to small values at distances from 
the origin r <I/e which are “classically inaccessible’ (because of the centrifugal 
barrier). Hence, if we “join” the solution of equation (39.5) to that of the exact 
equation (39.4) at short distances r~r,, where l/c >1r,; > Za/e, then the boundary 
condition for the solution of equation (39.5) will be that it is small, and this justifies 
our choice. 


PROBLEM 


For an attractive Coulomb field with Za <1, find the correction (of relative order Za) to the 
non-relativistic wave function of the discrete spectrum. 

SOLUTION. The electron velocity in a bound state is v ~ Za, and therefore, for Za «<1, the wave 
function is non-relativistic in the zero-order approximation, i.e. 


yp = UWnon-r, 


where Wnon-+ 18 the Schrédinger function and u a bispinor of the form 


with w a spinor describing the polarization state of the electron. In the next approximation, we write 
ib = Ufnons + Ww? and, substituting this in (39.4), obtain for yw” the equation 


ITAL La 0215 ( *)° 
(= A léa| > )y =F ie es (aU) Wnon-t; 


+ In the method of solution given in QM, $135, this condition was satisfied by taking the particular 
integral in the form (135.1) instead of as a general sum of integrals with different values of B; and Bo. 
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where e, is the non-relativistic discrete energy level. Here we have omitted terms of relative order 
~(Za)’; in the non-relativistic case, the important distances are of the order of the Bohr radius, 
r~1/mZa. The solution of this equation is wp” = — (i/2m)au - VYnon-+, and therefore 


y= (1 ee Qa: V Jhon 


2m 


§40. An electron in the field of an electromagnetic plane 
Wave 


Dirac’s equation can be solved exactly for an electron moving in the field of an 
electromagnetic plane wave (D. M. Volkov, 1937). 

The field of a plane wave with wave 4-vector k (k*=0) depends on the 
4-coordinates only in the combination ¢ = kx, so that the 4-potential is 


A* = A*(d), (40.1) 
and satisfies the Lorentz gauge condition 
0,A* =k,A*'=0, 


the prime denoting differentiation with respect to @. Since the constant term in A is 
unimportant, we can omit the prime, writing the condition as 


kA =0. (40.2) 
We start from the second-order equation (32.6), in which the field tensor is 
F,, =k, A, — kA. (40.3) 


When expanding the square (id — eA)’ it must be remembered that, from (40.2), 
d,(A*p) = A*d,.w. The result is 


[—0?— 2ie(Ad) + e*A’— m? — ie(yk)(yA')|b = 0, (40.4) 


where 0° = 4,0". 
We seek a solution of this equation in the form 


p=e *F(d), (40.5) 


where p is a constant 4-vector. This form of the function w is unaltered by adding 
to p any constant multiple of the vector k, if the function F(¢) is appropriately 
redefined. We can therefore, without loss of generality, impose one further con- 
dition on p. Let 


p’=m’. (40.6) 


Then, when the field is removed, the quantum numbers p” become the components 
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of the free particle 4-momentum. The significance of the components of the 
4-vector p, when the field is present, is more clearly seen in a particular frame of 
reference chosen so that Ay = 0. Let the vector A in this frame be along the x'-axis 
and k along the x°*-axis; the electric field of the wave is then along x', the magnetic 
field along x’, and the wave itself is propagated along x°*. Then (40.5) will be an 
eigenfunction of the operators 


: . Oo 3 . Oo P 2 .[ 0 ey) 
Pi=iggn  Pa=tay Po Ps il5,0~ 5,3) 


with eigenvalues pj, Po, Po—p3; the operators themselves are easily seen to 
commute with the Hamiltonian of Dirac’s equation. Thus, in this frame of 
reference, p' and p” are the components of the generalized momentum along the x’ 
and x* axes; p’—p?’ is the difference between the total energy and the x’- 
component of the generalized momentum. 

In substituting (40.5) in (40.4), we note that 


a“F=k"F', 0,0"F =k’?F"=0, 
and obtain for F(@) the equation 
2i(kp)F' +[—2e(pA) + e?A? — ie(yk)(yA")|F = 0. 


The integral of this equation is 
kx 


F = exp|-i | py 4) roe | ee eo 


where u/\V(2po) is an arbitrary constant bispinor; the reason for writing it in this 
form will be shown below. 
All powers of (yk)(yA) above the first are zero, since 


(yk )(yA)(yk)(yA) = —(yk)(yk)(yA)(yA) + 2(KA)(yk)(yA) = —k?A? = 0. 
We can therefore write 


e(yk (yA) _ 
2(kp) xp) 


so that w becomes 


u is 
bp = J+ ep) TOA] apy & (40.7) 
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kx 


ee ae eee ee 
S =-—px [ lye spy 4” | dd. (40.8) 


0 


To determine the conditions to be imposed on the constant bispinor u, we must 
suppose that the wave is “‘switched on” with infinite slowness, starting from 
t=—«, Then AO when kx >~—, and Ww must become the solution of the free 
Dirac’s equation. Consequently, u = u(p) must satisfy 


(yp —m)u = 0. (40.9) 


This condition rejects the “redundant” solutions of the second-order equation. 
Since u is independent of time, the condition remains valid for finite kx. Thus u(p) 
is the same as the bispinor amplitude of the free plane wave; we shall take it to be 
normalized by the same condition (23.4): mu = 2m. 

The foregoing arguments also show immediately the normalization of the wave 
functions (40.7). The infinitely slow application of the field does not alter the 
normalization integral. Hence it follows that the functions (40.7) satisfy the same 
normalization condition, 


| Usp, d°x = | byy Ww, d’x = (27)*5(p'— p), (40.10) 


as the free plane waves. 
Let us calculate the current density corresponding to the functions (40.7), first 
noting that 


ee ae is 
by = Toap| tae) OAC Je 


and hence obtaining by direct multiplication 


ee eee al a L b e(pA)_ eA’ 
1” = yh | eA" +k ea dep) (40.11) 


If the A’(@) are periodic functions, and their time-average value is zero, the mean 
value of the current density is 


mae 1 ( e° are 
e=—[p* — Ark \. 40.12 
PMP’ 30K) CoH) 


We can also find the kinetic momentum density in the state w,. The kinetic 


+ This S is the same as the classical action for a particle moving in the field of a wave; cf. Fields, 
$47, Problem 2. 
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momentum operator is the difference p —-eA =iad—eA. A direct calculation gives 


Wp (p* 7 eA" hp = Upy (p" eA”), 


A) eA’ ie 
— nk — An + ke (OP _ Ke F,, * AV : 
ae (kp) 2(kp)) * * 8(kpypo "7 
(40.13) 
The time-average value of this 4-vector, denoted by gq", is 
eA’ 
qo =p — (kp) k*, (40.14) 
Its square 1s 
2 2 e° -A2 
qa=-m,, my, =m ( -2), (40.15) 


where m,, acts as an “effective mass” of the electron in the field. A comparison of 
(40.14) with (40.12) shows that 


j* =q"Ipo. (40.16) 


The normalization condition (40.10), expressed in terms of the vector q, is 
[ veux = ny 2 aq’ -q: (40.17 
0 


this is most simply proved in the particular frame of reference mentioned above. 


§41. Motion of spin in an external field 


The quasi-classical approximation in Dirac’s equation is reached in the same 
way as in the non-relativistic theory. In the second-order equation (32.7a) we 
substitutet 

w=u eS: 
where S is a scalar and u a slowly varying bispinor. The usual condition of the 
quasi-classical case is assumed to be satisfied: the momentum of the particle must 
vary only slightly over distances of the order of the wavelength h/|pl. 

In the zero-order approximation with respect to h, we obtain the usual classi- 
cal relativistic Hamilton-Jacobi equation for the action S. All the terms which 
contain the spin (and are proportional to h) are absent from the equations of 
motion. The spin would appear only in the next approximation with respect to h. 


+ Ordinary units will be used at first. 


152 Particles in an External Field 841 


Thus the influence of the magnetic moment of the electron on its motion is always 
of the same order of magnitude as the quantum corrections. This is to be expected, 
since the spin is a purely quantum property and its magnitude is proportional to h. 

We can therefore reasonably formulate the question of how the electron spin 
will behave when the electron is executing a given quasi-classical motion in an 
external field. The answer to this question is contained in the next approximation 
with respect to h in Dirac’s equation. We shall, however, use another method 
whose significance is more obvious and which does not directly involve Dirac’s 
equation. It has the advantage of allowing a treatment of the motion of any particle, 
including a particle which has an “‘anomalous”’ gyromagnetic ratio not describable 
by Dirac’s equation. 

The objective is to derive an “equation of motion” for the spin when the 
particle moves in any (given) manner. Let us first take the non-relativistic case. 

The non-relativistic Hamiltonian of a particle in an external field is 


H =H'-ypo-H, (41.1) 
where H’ includes all terms independent of the spin (see QM, §111), and p is the 
magnetic moment of the particle. This form of the Hamiltonian relates to any kind 
of particle. For electrons, 4 = eh/2mc (the electron charge being e = —|e|), and for 
nucleons p also contains the “anomalous” partt 


pw = wp — eh/2mce. (41.2) 


According to the general rules of quantum mechanics, the operator equation of 
motion for the spin is obtained from the formula 


(H§—§H) =~ (Ho-—oB). (41.3) 
Substitution of (41.1) gives 
Goan 1 Hy. (0x6; — 0j0x) 
=a Cin Hon, 
or 


g= 2H sxH. (41.4) 


We average this operator equation over the state of the quasi-classical wave 
packet moving in a given path. This 1s equivalent to replacing the spin operator by 


+ When radiative corrections are taken into account the magnetic moment of the electron also 
contains a very small “anomalous” part. 
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its mean value s, and the vector H by the function H(t), which represents the 
change in the magnetic field at the position of the particle (or wave packet) as the 
latter moves along its path. In the non-relativistic approximation (i.e. in terms of 
Pauli’s equation), §=30 is the spin operator of the particle in its rest frame, whose 
mean value was denoted in §29 by 3¢. Thus we obtain the equation 


df _ 2u 
= 4 bX HO). (41.5) 


This form of the equation is, in essence purely classical. It signifies that the 
magnetic moment vector precesses about the direction of the field with angular 
velocity —2uH/h, remaining constant in magnitude.t 

Again in the non-relativistic case, the velocity v of the particle varies in 
accordance with the equation 


dv/dt = ev x H/mc, 


i.e. the vector v rotates about the direction of H with angular velocity —eH/mce. If 
u'=0, then wp =eh/2mc, and this angular velocity is the same as the angular 
velocity —2uH/h with which the vector ¢ rotates; thus the polarization vector is at 
a constant angle to the direction of motion. We shall see below that this result 
remains valid in the relativistic case. 

Let us now proceed to the relativistic generalization of equation (41.5). For a 
covariant description of the polarization it is necessary to use the 4-vector a 
defined in §29, and the equation of motion of the spin will determine its derivative 
with respect to the proper time 7.4 

The form of this equation is given by considerations of relativistic invariance: 
its right-hand side must be linear and homogeneous in the electromagnetic field 
tensor F"” and in the 4-vector a”, and apart from these can include only the 
4-velocity u“ = p"/m. The only form of equation satisfying these conditions is 


da"/dt = aF“’a, + Bu" F”u,a,, (41.6) 


Where a and B are constant coefficients. It is easily seen, from the condition 
a,u”" =0 and the antisymmetry of the tensor F"” (whence F*’u,u, =0), that no 
other expressions with the required properties can be constructed. 

As v0, this equation must become the same as (41.5). Putting a” = (0,0), 
u“ =(1, 0), 7=t, we have 


dt/dt = af XH. 


A comparison with (41.5) shows that a = 2p. 
+ The classical equation (41.5) can be derived directly from the equation 
dM/dt = pH, 
where M is the angular momentum and p the magnetic moment of the system; p X H is the torque acting 


on the system. Putting M = rh, p = (u/2s)6 = wl, we have (41.5). 
+ From here onwards we again take c = 1, h = 1. 
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To determine 8, we use the fact that a“u, =0. Differentiating this with respect 
to 7 and using the classical equation of motion of a charge in a field, 


m du"/dt = eF*’u,, 
(see Fields, 823), we obtain 


da" _ du" _ 
Mu dt On dt 


e e 
= pV = pv 
GPU = = Pays 


Hence, on multiplying both sides of equation (41.6) by u,, using the equation 
u,u” = 1 and cancelling the common factor F“’u,a,, we have 


Thus the final relativistic equation of motion for the spin is 


ao = Fa, —2p'u*F"u,a, (41.7) 
(V. Bargmann, L. Michel and V. L. Telegdi, 1959). 

We can change from the 4-vector a to the quantity ¢ which directly represents 
the polarization of the particle in its “instantaneous” rest frame. The relation 
between a and € is given by formulae (29.7)-(29.9). First of all, from (41.7) we 
necessarily have a,da"/drt =0, and therefore a,a“ =constant. Since a,a” = —-¢ 
this is equivalent to the obvious result that the polarization ¢ of the particle remains 
unchanged in magnitude during its motion. 

The equation which shows the change in direction of the polarization is 
obtained by using three-dimensional notation in (41.7). The space components of 
this equation are, in explicit form, 


da_ 2um 2um 
di omens Wo | earn (a 


a BE) +2h 


= v(y- a X H)+ 


,2ple 


v(a-v)(v- E). 


Here we must substitute (29.9), using in the differentiation the equations p= ev, 
e* = p’+m’, and the equations of motion 


dp/dt = eE+ ev XH, de/dt = ev-E. (41.8) 


A lengthy but elementary calculation leads to the result 


+ This equation was first derived, in another form, by Ya. I. Frenkel’ (1926). 
+ If the gyromagnetic ratio (Landé factor) g is used (as is often done) for charged particles, with 
= g(e/2m)- 3 (= g(e/2mc) - xh), this equation becomes 


a. mal 2+ 2 Ex H+ o(9 2) 


Sv Hy xb +5 ( = \ex (Ex). (41.9a) 
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df_ 2pm t+ 2p"(e — Mex y+ Be Ey. By eg ce 


dt E etm a al 


(41.9) 


The variation of the direction of polarization relative to the direction of motion 
is of more interest than the variation of its absolute position in space. We write 


C=ngit+%, (41.10) 


where n=v/v, and derive the equation for the component ¢ of the polarization in 
the direction of motion. A calculation using (41.8), (41.9) leads to the resultt 


18 oy 't, xn (ME ‘Yeu: -E. (41.11) 


The problems at the end of this section include a number of examples of the 
application of the above formulae. Here it may be noted that, in motion in a purely 
magnetic field, the polarization of a particle having no anomalous magnetic moment 
is at a constant angle to the velocity (¢;=constant). Thus this result, already 
mentioned previously for the non-relativistic case, is in fact a general one. 

The conditions for the above formulae to be applicable can be stated more 
precisely. The requirement specified initially, that the momentum of the particle 
should vary sufficiently slowly, is equivalent to a certain condition that the fields E 
and H should be small; in particular, the Larmor radius in the magnetic field 
(~ p/eH) must be large compared with the wavelength of the particle. There is also, 
however, another condition which must, strictly speaking, be fulfilled: the fields 
must not vary too rapidly in space, and must vary only slightly within the 
dimensions of the quasi-classical wave packet. That is, the field must vary only 
slightly over distances of the order of the particle wavelength (1/p) and of the 
Compton wavelength (1/m).+ 

In practical problems of motion in macroscopic fields, however, the condition of 
slow variation is certainly satisfied, and only the condition of smallness remains. 

In $33 we have derived the first relativistic corrections for the Hamiltonian of 
an electron moving in an external field. For an electron in an electric field the 
approximate Hamiltonian is (see (33.12)) 


4 _ Wr & eo Rxe a 
H=H'-7~o-Explm, p=—i, (41.12) 


where H’ includes the terms which do not contain the spin. In our case, since the 


+ This equation can be obtained a little more directly by writing explicitly the time component of 
equation (41.7). 

+ The latter requirement arises from the condition that the spread of velocities in the wave packet, in 
its rest frame, must be small compared with c, since otherwise the non-relativistic formulae could not be 
applied in this frame. 

If the field varies too rapidly, the equations may contain significant additional terms in the 
derivatives of the field with respect to the coordinates. 
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field varies slowly, we neglect the term in H’ which involves derivatives of E (i.e. 
the term in div E); the small term in p* may also be omitted, since it is unrelated to 
the field effects in question here. Thus H’, in the absence of a magnetic field, 
reduces to the non-relativistic Hamiltonian H' = p’/2m + e®. 

Formula (41.12) can also be derived from (41.9) without making direct use of 
Dirac’s equation. This method will generalize it (in the quasi-classical case) to 
particles with anomalous magnetic moment. 

The equation of motion of the spin in an electric field, as far as first-order terms 
in the velocity v, is obtained from (41.9) as 


P= (w+ WIEX (EX v) = (52 + 2p" Ex EX). 


If we impose the condition that this equation should be derived quantum- 
mechanically by commuting the spin operator with the Hamiltonian (as in (41.3)), 
then it is easily seen that we must put 


A 


A= AH (wit ee 


° xT 

in)e E X p/m. (41.13) 
This is the required expression. If w'=0, we again obtain (41.12). It should be 
noted that the ‘“normal’’ magnetic moment e/2m is multiplied by an extra factor sin 
comparison with the anomalous moment p’.t 


PROBLEMS 


PROBLEM I. Determine the change of the direction of polarization of a particle when it moves in a 
plane perpendicular to a uniform magnetic field (v.H). 


SOLUTION. The right-hand side of equation (41.9) is reduced to its first term, and the vector ¢ 
therefore precesses about the direction of H (the z-axis) with angular velocity 


2m + 2p(e — 
~ som See =~ (£4 2u')M. 


The projection of ¢ on the xy-plane (denoted by ¢;) rotates in that plane with the same angular velocity. 
The vector v rotates in that plane with angular velocity —eH/e, as can be seen from the equation of 
motion p = ev = ev XH. Hence ¢ rotates with angular velocity —2w’H relative to the direction of v. 


PROBLEM 2. The same as Problem 1, but for motion parallel to the magnetic field. 
SOLUTION. When v and H are in the same direction, equation (41.9) reduces to 


dt_2um 
dt ¢ ae 


so that € precesses about the common direction of v and H with angular velocity —2umH/c. 


PROBLEM 3. The same as Problem 1, but for motion in a uniform electric field. 


SOLUTION. Let the field E be along the x-axis, and let the motion be in the xy-plane (with 
py = constant). According to (41.9), the vector ¢ precesses about the z-axis with instantaneous angular 


{+ This is the ‘““Thomas half’? mentioned in the last footnote to §33. Its origin is clearly shown by 
the derivation given here. 
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velocity 


€ 1\ Py 
_ + ne 
(- 2 JE 


We again resolve € into components ¢ (in the xy-plane) and ¢.. Then 
Ci = ci cos @, C.-E=—-f) sing: v/v. 


From (41.11), : rotates relative to the direction of v with instantaneous angular velocity 


-  2vy/pm> fem 
$23 (tn) BCR) 


§42. Neutron scattering in an electric field 


In collisions between neutrons and nuclei, the scattering through large angles is 
determined by the main interaction, the nuclear forces. In small-angle scattering, 
however, it can be shown that the interaction of the magnetic moment of the 
neutron with the electric field of the nucleus becomes important (J. Schwinger, 
1948). 

We shall assume that the neutron is non-relativistic, so that the interaction in 
question is described by the approximate Hamiltonian (41.13). The magnetic 
moment of an electrically neutral particle is wholly ‘“‘anomalous”’ and the operator 
H’ reduces in this case to the kinetic-energy operator:t 


h? 


A=-s A+iMg-exy. (42.1) 
m mec 


Since the electromagnetic interaction of the neutron is small, the corresponding 
scattering amplitude f.,, may be calculated by the Born approximation: 


ee Le —ip'er/h . ph : ip-r/h 73 
ies sa |e (i tte ExV)e d°’x 


(see QM, §126), or 


fen = soe EyXp,  Ey= i E(r) e-"" dx, (42.2) 
where p and p’ are the neutron momenta before and after scattering, and hq = 
p’—p. In this form, the amplitude f., is an operator with respect to the spin 
variable. 

Before continuing the calculation, we should note the following point. Formula 
(42.1) has been derived in 841 for slowly varying fields (which in practice meant 
neglecting terms in the Hamiltonian containing coordinate derivatives of the field). 
As applied to the Coulomb field of the nucleus, this means that the wavelength h/p 


+ In this section, ordinary units are used, and m denotes the mass of the neutron. 


QE4-L 
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must be small compared with the distances r~1/q which are important in the 
integral E,. Hence h,<p, so that the scattering angle 6 ~ hq/p <1. Thus the 
required condition is in fact satisfied for small-angle scattering. 
For a Coulomb field with potential ® = Ze/r, the Fourier component of the field 
is 
By = ~iqh, = iq =": 


see Fields, (51.5). Substitution in (42.2) gives 


. 2Ze€ ; 
fem =i pT DX" 


For small scattering angles, hq ~ p@ and px p' ~ p’@v, where »p is a unit vector in 
the direction of p X p’. Thus 


_ .2LZep 
tea Dhic Ov. 


The nuclear scattering amplitude must be added to this expression. Owing to the 
rapid decrease of the nuclear forces with increasing distance, this amplitude tends 
for small angles to a finite (energy-dependent) complex limit, which we denote by 
a. The total scattering amplitude is therefore 


f=a+ti(b/@)o-v, b = 2Zepu/ch = 2Zap/e. (42.3) 


We see that the electromagnetic scattering is indeed predominant at sufficiently 
small angles. 

The expression (42.3) is the same in form as that discussed in QM, §140. We can 
therefore make direct use of the formulae derived there. The scattering cross- 
section summed over all possible final polarization states is 


2 
a = Ja? +2542 mane. (42.4) 


where € is the initial polarization of the neutron beam (called P in QM, $140). If the 
initial state is unpolarized (¢ = 0), then the polarization after scattering is 


2b0ima 


G = jal’o°+ b? Vv. (42.5) 


This is a maximum when 6 = b/|a|, and fiax = im a/lal. 


CHAPTER V 


RADIATION 


§43. The electromagnetic interaction operator 


THE interaction of electrons with an electromagnetic field can, as a rule, be treated 
by means of perturbation theory. This is because the electromagnetic interaction is 
comparatively weak, as is shown by the smallness of the corresponding dimension- 
less ‘“‘coupling constant”, viz. the fine-structure constant a = e’/fc = 1/137. The 
smallness of this number is of fundamental importance in quantum elec- 
trodynamics. 

In classical electrodynamics (see Fields, §28), the electromagnetic interaction is 
described by the term 


— ej"A, (43.1) 


in the Lagrangian density of the ‘‘field + charge” system (A being the 4-potential of 
the field and j the particle current density 4-vector). The current density satisfies 
the equation of continuity, 


a,j" =0, (43.2) 


which expresses the law of conservation of charge. According to Fields, §29, the 
gauge invariance of the theory is closely related to this law: when A, is replaced by 
A, + 4,x (4.1), a term —ej”d,x is added to the Lagrangian density (43.1), and this, by 
(43.2), may be written as the 4-divergence —ed,(xj"); it therefore disappears on 
integration over d*x in the action S = f L d‘x. 

In quantum electrodynamics, the 4-vectors j and A are replaced by the 
corresponding second-quantized operators. The current operator is expressed in 
terms of the w-operators by j = Wy. The generalized “coordinates” q in the 
Lagrangian 


| Ce d°x me tae [ GA) d°x 


are represented by the values of w, & and A at each point in space. Since the 
Lagrangian density is found to depend only on the ‘coordinates’? q themselves 
(and not on their derivatives with respect to x), the change to the Hamiltonian 
density by formula (10.11) amounts simply to a change in the sign of the Lagran- 
gian density.f Thus the electromagnetic interaction operator (the space integral of 


+t Independently of these arguments, it may be noted that, when only the first-order small correction 


is considered, any small correction in the Lagrangian appears in the Hamiltonian with just a change of 
sign (see Mechanics, §40). 
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the interaction Hamiltonian density) has the form 
V=e | (jA) d°x. (43.3) 
The free electromagnetic field operator is the sum 
A =D [érAn(x) + Cr AR], (43.4) 


which contains the operators of photon creation and annihilation in various states 
labelled by the suffix n. Each operator has matrix elements only for an increase or 
decrease of the corresponding occupation number N, by 1 (the other occupation 
numbers remaining unchanged). The operator A therefore also has matrix elements 
only for transitions in which the number of photons changes by 1. That is, only 
processes of the emission or absorption of a single photon occur in the first 
approximation of perturbation theory. 
According to (2.15), the matrix elements are 


(Na — 1en|Nn) = (Nn |ct]Nn — 1) = V Ny. (43.5) 
If there are no photons (of type n) in the initial state of the field, then (1|c;|0) = 1. 
The matrix element of the operator (43.3) for photon emission is 


Viilt) =e (jnA*) d°x, (43.6) 


where A,(x) is the wave function of the emitted photon and Jj; the matrix element 
of the operator j for a transition of the emitter from the initial state i to the final 
state f.t The 4-vector jf = (pyi, jj) is called the transition current. 

Similarly, we obtain the matrix element for photon absorption: 


V;i(t) = e | (jAn) 2x. (43.7) 


This differs from (43.6) only by having A,(x) in place of A*(x). 

The argument t of V;; is shown in order to emphasize that the matrix element is 
time-dependent. By separating the time factors in the wave functions, we can 
change in the usual way to time-independent matrix elements: 


Vi(t) = Va er For, (43.8) 


where E;, EF; are the initial and final energies of the emitting system, and the sign + 
is for emission and absorption respectively of a photon wo. 


+ The notation in (43.6) is slightly inconsistent. The suffixes in V;; refer to states of the whole system 
‘“‘emitter + field’, those in jp: to states of the emitter only. 
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The wave function of a photon with a definite momentum k and a definite 
polarization 1s 


= V (47) ame ens (43.9) 


(see (4.3); the time factor is omitted). Substituting in (43.6), we find the matrix 
element for the emission of such a photon: 


Vi = eV (477) e*jH(k), (43.10) 


1 
(2) 
where j(k) 1s the transition current in the momentum representation, 1.e. the 
Fourier component 


Ji (k) = [ ine ek" dey, (43.11) 
The corresponding formula for photon absorption is 


Vii = eV (477) e,j#(—k). (43.12) 


1 
(2m) 

The equation of conservation of current in the momentum representation is the 
condition of 4-transversality of the transition currents: 


ki} = wpyi(k) —k + ji(k) = 0. (43.13) 


The formulae given in this section do not assume any particular form of the 
current operator, and are generally valid for electromagnetic processes involving 
any charged particles. The existing theory allows the form of the current operator 
to be determined (and hence, in principle, its matrix elements to be calculated) only 
for electrons. For applications to systems of strongly interacting particles, includ- 
ing nuclei, a semi-phenomenological theory will be used, in which the transition 
currents appear as empirically determined quantities subject only to the conditions 
of space-time symmetry and to the equation of continuity. 


§44. Emission and absorption 


The transition probability under the action of a perturbation V is given, in the 
first approximation, by the well-known formulae of perturbation theory (QM, 842). 
Let the initial and final states of the emitting system belong to the discrete 
spectrum.t Then the probability (per unit time) of the transition i—f with emission 


+ This certainly implies that recoil is neglected, the emitter as a whole remaining at rest. 
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of a photon is 
dw = 2a| Vii) S(E; = E; a w) dy, (44.1) 


where dv arbitrarily denotes the ensemble of quantities describing the state of the 
photon and taking a continuous sequence of values; the photon wave function: is 
assumed normalized by the delta function ‘‘on the v scale”’. 

If a photon having a definite angular momentum is emitted, the only continuous 
variable is the frequency w. Integration of (44.1) with respect to dv = dw eliminates 
the delta function, w being replaced by E; — E;, and the transition probability is 


w = 2a| Vj)’. (44.2) 


If, however, we consider the emission of a photon having a given momentum k, 
then dv = d*k/(27)’ = w’ dw do/(27r)’. Here it is presupposed that the photon wave 
function (plane wave) is “‘normalized to one photon in the volume V = 1’, as 
always in this book; dv is the number of states in the phase volume V d’*k. Thus the 
probability of emission of a photon with a given momentum is 


dw = 22|V,)?8CE; — E; — w) d’k/(27)’, (44.3) 


or, after integration over dw, 
dw = = |V Pw? do. (44.4) 
Aa? fi ; 


In this we must substitute the matrix element V;; from (43.10). 

In subsequent sections we shall use these formulae to calculate the probability 
of emission in various specific cases. Here we shall consider certain general 
relations between radiative processes of various kinds. 

If in the initial state of the field there is already a non-zero number N,, of the 
photons in question, the matrix element for the transition is multiplied by 


(Nz + Ics|Nn) = VIN, + 1), (44.5) 


i.e. the transition probability 1s multiplied by N, +1. The 1 in this factor cor- 
responds to the spontaneous emission which occurs even if N, =0. The term N,, 
represents the stimulated or induced emission: we see that the presence of 
photons in the initial state of the field stimulates the further emission of photons of 
the sarne kind. 

The matrix element V, for the transition with the opposite change of state of 
the system (f —i) differs from Vj, in that (44.5) is replaced by 


(N, — 1en|Nn) = V Nn 


(and the other quantities are replaced by their complex conjugates). This opposite 
transition is a transition of the system from the level E; to the level E; with 
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absorption of a photon. Thus the photon emission and absorption probabilities for a 
given pair of states i, f are related byt 


WelWa =(N, + LIN, (44.6) 


an expression first derived by A. Einstein (1916). 
The number of photons can be related to the intensity of the external radiation 
incident on the system. Let 


Ixe dw do (44.7) 


be the radiation energy incident on unit area per unit time and having polarization 
e, frequency in the range dw and wave-vector direction in the solid-angle element 
do. These ranges correspond to k* dk do/(27r)’ field oscillators, each having Nye 
photons of the specified polarization. Hence the same energy (44.7) is given by the 
product 


k? dk do — hw? 
C “Ory N,.fw = 8a ce Ne dw do. 
From this we find the required relation: 
Nye = ons he. (44.8) 


Let dw?) be the probability of spontaneous emission of a photon with 
polarization e into the solid angle do, and let the indices (in) and (a) denote the 
corresponding probabilities for induced emission and for absorption. According to 
(44.6) and (44.8), these probabilities are related as follows: 


Siaec" 
hw 


dwi\) = dw” = dw?) - Tis: (44.9) 


If the incident radiation is isotropic and unpolarized (J,. independent of the 
directions of k and e), then the integration of (44.9) with respect to do and 
summation with respect to e gives similar relations between the total probabilities 
of radiative transitions (between given states i and f of the system): 


(in) _ 


D a0 
yw!) = yw Wis I, (44.10) 


where I = 2 X 471. is the total spectral intensity of the incident radiation. 

If the states i and f of the emitting (or absorbing) system are degenerate, the 
total probability of emission (or absorption) of the photons concerned is found by 
summation over all mutually degenerate final states and averaging over all possible 


+ In the rest of this section, ordinary units are used. 
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initial states. Let the degrees of degeneracy (statistical weights) of states i and f be 
g; and g;. For processes of spontaneous or induced emission, the states i are the 
initial states, and for absorption the states f. Assuming in each case that all g; or g; 
initial states are equally probable, we obviously have instead of (44.10) the relations 


agen 
gw = gi = giw'?) eI (44.11) 


In the literature one frequently meets the Einstein coefficients, defined as 
Ai = wi?) Bi = wT, By _ wc/T, (44.12) 


where I/c is the spatial spectral density of radiation energy. They are related by the 
equations 


2, By _ giBiy _ giAym’c?/he’. (44.13) 


$45. Dipole radiation 


Let us apply the formulae derived above to the emission of a photon by an 
electron (in general, a relativistic electron) moving in a given external field. In this 
case the transition current is the matrix element of the operator 


i = wy, 


in which the W-operators are assumed expanded in terms of the wave functions of 
stationary states of the electron in a given field (§32). The matrix element 
(0;1,|j|1,0;) corresponds to a transition of the electron from state i to state f. This 
change in the occupation numbers is brought about by the operator dj 4;, and the 
transition current is 


ik = Wy" ds = (btu, Ways), (45.1) 


where wy; and yy are the wave functions of the initial and final states of the electron. 

Let the wave function of the photon be chosen in the three-dimensionally 
transverse gauge (the polarization 4-vector e = (0,e)). Then the product j;e* = 
— jg e* in (43.10). Substituting V;; in (44.4), we obtain the following expression for 
the probability (per unit time) of emission of a photon with polarization e into the 
solid-angle element do: 


AWen = €7(w/277)|e* + 57k)!’ do, (45.2) 


where 


jn(k) = | Wray; > e*'" d°x. (45.3) 
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Summation with respect to the polarization of the photon is effected by 
averaging over the directions of e (in a plane perpendicular to the given direction 
n=k/q@), and the result is then doubled because of the two independent possible 
transverse polarizations of the photon.+ Thus the result is 


dw, = e'(w/277)\n X ji (kK) do. (45.4) 


A very important case is that where the photon wavelength A is large compared 
with the dimensions a of the radiating system. This usually means that the velocity 
of the particles is small compared with that of light. In the first approximation in 
a/X (corresponding to dipole radiation: cf. Fields, §67), the factor e “‘" varies only 
slightly in the region where jf, or wy is appreciably different from zero, and it can be 
replaced by unity in the transition current (45.3). This implies that the photon 
momentum ts neglected in comparison with the momenta of the particles in the 
system. 

In the same approximation, the integral j,,(0) may be replaced by tis non- 
relativistic value, which is simply the matrix element v;; of the electron velocity 
with respect to the Schrodinger wave functions. In turn, this element vy = — iwryi, 
and er; = d,,, where d is the dipole moment of the electron (in its orbital motion). 
Thus we have the following formula for the probability of dipole radiation: 


dWen = (w°/27)le* . d,i|’ do. (45.5) 


(Here the direction of n occurs implicitly: the vector e must be perpendicular to n.) 
Summation with respect to the polarizations gives 


dw, = (w*/27)|n Xx dil’ do. (45.6) 


Since these formulae are non-relativistic (as regards the electron), they can be 
immediately generalized to any electron system by taking d,; as the matrix element 
of the total dipole moment of the system. 

Integrating (45.6) over all directions, we have the total probability of radiation: 


Ww = (40°/3)|\dyil’, (45.7) 
or, in ordinary units, 


w = (4w°/3hc>)|dyi|. (45.7) 


+ In the averaging, we use the formula 


eet = (Sik — nink) (45.4a) 
or 


(a-e)(b- e*) = .fa-b—(a-n)(b- n)} 
=5faxn]-[b Xn], (45.4b) 


where a and b are constant vectors. 
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The intensity I is found by multiplying the probability by ha: 
I = (4*/3c’)|dyil’. (45.8) 


This is directly analogous to the classical formula (see Fields, (67.11)) for the 
intensity of dipole radiation from a system of periodically moving particles: the 
intensity of radiation at frequency w, = Sw (where w is the frequency of the 
particle motion and s an integer) is 


I, = (4w5/3c°)|d,|’, (45.9) 


where d, are the Fourier components of the dipole moment, i.e. the coefficients in 
the expansion 


d(t)= > de. (45.10) 


§=—00 


The quantum formula (45.8) is got from (45.9) by replacing these Fourier com- 
ponents by the matrix elements of the corresponding transitions. This rule (which is 
an expression of Bohr’s correspondence principle) is a particular case of a general 
relation between the Fourier components of classical quantities and the quantum 
matrix elements in the quasi-classical case (see QM, 848). The radiation is quasi- 
classical for transitions between states having large quantum numbers; the tran- 
sition energy hw = E; — E; 1s then smail in comparison with the energies E; and FE; 
of the radiator. This, however, would not lead to any change in the form of (45.8), 
which is valid for all transitions. This explains the fact (which is something of an 
accident) that the correspondence principle for the radiation intensity is valid not 
only in the quasi-classical but in the general quantum case. 


§ 46. Electric multipole radiation 


Instead of considering the emission of a photon in a given direction (i.e. with a 
given momentum), let us now consider the emission of a photon with definite 
values of the angular momentum j and its component m in some chosen direction z. 
We have seen in §6 that such photons can be of two kinds, electric and magnetic. 
Let us take first the emission of electric photons, and again assume that the 
dimensions of the radiating system are smail in comparison with the wavelength. 

The calculations are conveniently carried out by means of the photon wave 
functions in the momentum representation, i.e. by expressing the 4-vector A”(r) as 
a Fourier integral. Then the matrix element is 


Vi =e | iAte) ax 


3 
=e | d°x - j#(r) | om A*(k) e*"'; (46.1) 


for simplicity, we omit the suffixes wjm to the photon wave functions. 
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For an Ej photon we take the wave function from (7.10), with the arbitrary 
constant C having the value 


The reason for this choice is to ensure that, in the spatial components of the wave 
function (A), the terms containing spherical harmonics of order j — 1 cancel (as is 
seen from formulae (7.16)). Then A will include only spherical harmonics of order 
j +1, and therefore the corresponding contribution to Vj; is (as will be clear from 
the subsequent calculation) of a higher order of smallness (in a/A) than the 
contribution from the component A°=®, which includes spherical harmonics of 
the lower order Jj. 
Thus we put 


An = (0,0), =~ fLEESE, 5(k| ~ 0) Yntm) 


(n=k/w). Substituting this expression in (46.1) and carrying out the integration 
over d|k|, we obtain 


= Vi=—eit} ia} Veo lg x p(t) | doy e “ik FY (p), (46.2) 


To calculate the inner integral, we use the expansion (24.12), written in the form 


ered SD i'g,(kr) Y #,(k/k) Yim (rir), (46.3) 
where 
gi(kr) = = Ji(kr); (46.4) 
see QM, (34.33.7 
Substitetion of this expansion in (46.2) gives 


| e*'Y*(n) do, =4xi ietkn) YR(r/r); 


the remaining terms are zero because of the orthogonality of the spherical har- 
monics. On account of the condition a/A <1, only distances such that kr <1 will be 
important in the integral with respect to d*x. We can therefore replace the 


* The normalization of the functions g; is such that their asymptotic form as kr>«< is 


ei(kr) = C/kr) sin(kr — 3l77). (46.44) 
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functions g;(kr) by the first terms of their expansions in powers of kr: 


gi(kr) = (kr)/(2j + 1)!!. (46.5) 
The result is 
mete (2U+ DG +1 i* ? 
Va = (nti LD Dem (46.6) 


jm fi i | U jm 


(Y;,-m =(-1)""Y%,). The quantities (46.7) are called the 2/-pole electric transition 
moments of the system, by analogy with the corresponding classical quantities 
(Fields, 841).# 

For an electron in an external field, ps = w¥W;, and the quantities (46.7) are then 
calculated as the matrix elements of the classical quantity 


An 
(6) ' 22 ly. 
m Wop gf tie 


In the non-relativistic case (as regards the particle velocities), the transition 
moment can in principle be calculated similarly for any system of N interacting 
particles. The transition density is expressed in terms of the wave functions of the 
system by 


N 
pyi(r) = | w(ry,..-rn)ui(r,.--,0n) = S(r—r,) d°x,... d°xn, (46.8) 


where the integral is taken over the whole of configuration space.§ 

The photon wave function used here corresponds (in the coordinate represen- 
tation) to normalization by the delta function on the w scale, as assumed in formula 
(44,2). Substituting (46.6), we find the probability of Ej radiation:|| 


a 2(2] £ LQ a 1) w7*e*|(Q! ) iP (46 9) 
me ere Dy aiid | 

+ The power of kr is equal to the order of the function Yjm by which g; is multiplied. This justifies 
the neglect of the terms in A which contain higher-order spherical harmonics. 

<t The multipole moments are defined without the factor e, since in this book the currents also are 
defined without the charge factor. 

§ A situation can occur where the transition probability vanishes according to the approximate 
selection rules, valid only when the spin-orbit interaction of the electrons is neglected. Then, to obtain a 
non-zero result, we must uSe the wave functions with the relativistic correction which takes account of 
this interaction. 


|| It might appear at first sight that, owing to the isotropy of space, the total probability of photon 
emission ought not to depend on the value of m. The incorrectness of this conclusion is easily seen if we 
notice that different final states of the system (for a given initial state) correspond to the emission of 
photons with different values of m; cf. the rule (46.16) below. 
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In particular, for j = 1 we have 
4w? : 
win =z CQ mi? (46.10) 


The quantities Q‘? are related to the components of the electric dipole moment 
vector by 


eQW@=id., CQ ur = F y7q (de + id,), (46.11) 


Summing (46.10) with respect to m, we naturally obtain the earlier formula (45.7) 
for the total probability of dipole radiation. 

The angular distribution of multipole radiation is given by formula (7.11). When 
this is normalized to the total emission probability wWjn, we have 


dWim = |Y'O(n) | Wim do 


Sp ce ee 
Gay IWaYinl” do. (46.12) 


In particular, for } = 1, 


Yio=l 7 cos 6, Vi =F iy sin oe, 
Aa 87 


where 6 and ¢ are the polar angle and azimuth of the direction n relative to the 
z-axis. On calculating the gradient, we find that the angular distribution of dipole 
radiation with a definite value of m is given by 


Z 
AW 9 = wig sin’ 6 do, AW, +) = Wy, 41 aE do. (46.13) 


These expressions could also, of course, be obtained from formula (45.6) by putting 
firstly (for m=0) d,=d,=0, d,=d, secondly (for m=+1) d, =¥id, = d/V2, 
,= 0. 
If the order of magnitude of the dimensions of the system (atom or nucleus) is 
a, then that of the electric multipole moments is, in general, Q\)~ a’. The 
probability of multipole radiation is 


wl) ~ ak(ka)". (46.14) 


When the multipole order increases by one, the probability decreases by a factor 
~(ka). 

The laws of conservation of angular momentum and parity imply certain 
selection rules which restrict the possible changes in the state of the radiating 
system. If the initial angular momentum of the system is J;, then, after emission of 
a photon with angular momentum Jj, the angular momentum of the system can have 
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only those values J; which are in accordance with the angular momentum addition 
rule (J; — J; = J): 


J, -J,<j<Jp+ J. (46.15) 


For given values of J; and J;, the same rule (46.15) specifies the possible values 
of the photon angular momentum j. But, since the probability of emission decreases 
rapidly with increasing j, the emission occurs principally with the lowest possible 
multipole order. 

The components M; and M; of the angular momenta J; and J;, and m of the 
photon angular momentum, satisfy the relation 


M; re M,; = Ls (46.16) 


which is obvious from the same law of addition of angular momenta. 

The parities P; and P, of the initial and final states of the radiating system must 
be such that P;Pp, = P;, where P,, is the parity of the emitted photon. Since the 
parities can have only the values +1, this condition may also be written 


P:P; = Pon. (46.17) 


For an electric photon P,, = (—1)', and the parity selection rule for electric multi- 
pole radiation is therefore 


P,P, = (-1). (46.18) 


The selection rules for total angular momentum and for parity are entirely 
rigorous and must be satisfied in emission by any systems. There may also be 
other rules which are more restrictive and which arise from certain properties of 
the structure of particular radiating systems. These latter rules must of necessity be 
approximate to some extent; they will be discussed in later sections of this chapter. 

The dependence of the emission probability on the quantum numbers m, M; and 
M,; is entirely determined by the tensor character of the multipole moments., The 
quantities Q;, with a given j form a spherical tensor of rank j. The dependence of 
its matrix elements on these quantum numbers is given by the formula 


a ae a 
[np JM; | Q;, —m|niSiM;))? = Ge . : i) Kn, J,OiIniJi)? (46.19) 


(see QM, (107.6)), where n conventionally denotes all the quantum numbers 
specifying the state of the system, other than J and M. The reduced matrix 
elements on the right of (46.19) do not depend on m, Mj, M;. On substituting this 
formula in (46.9), we obtain the required dependence, which is proportional to 


& j Jj iF 
M; m — M; : 
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here it is, of course, assumed that the emitter is not in an external field, and that the 
transition frequency w is thus independent of M; and M,. 

Summing the probability over all values of M; (for a given M;), we have the 
total probability of emission of a photon of a given frequency from the initial level 
n;, J; of the system. It is obvious from the isotropy of space that this quantity must 
also be independent of the initial value M;. The summation is carried out by means 
of the formula 


1 
» KnyJ;My| Qj, -m| niJiM;)|? = 41 I(n,J;||Q\|nJi)|? (46.20) 


(see QM, (107.11)). 


§47. Magnetic multipole radiation 


The wave function of a magnetic photon is A” = (0, A), where A is given by (7.6). 
Substitution im (46.1) gives for the transition matrix element 


Vi = -e¥e 2 | d?x + ji(r) | do, e FY™*(p), (47.1) 


The components of the vector Y‘"? can be expressed in terms of the spherical 
harmonics of order j, as shown in (7.16). Again using the expansion (46.3), we 
obtain for the inner integral 


J eM TY *(n) dog = 4ari'gi(kr) Ym *(r/1), 


and, on substituting g, from (46.5),t 


Vi =— ei! 20", (r)ri « Y¥8™*(r/r) d?x 
fi (2] +1)! Vi jm ‘ 


Here we must substitute, in accordance with the definition (7.4), 


(m) 
Yim (r/r) = TEST Yims 
we then transform the integrand by means of the formula 
r yy “rx VY*, =—rx Vi : V(r'Y*,), 


obtaining 


(7+ D)G+1) oi? | 


Vii = (-1)" a O+pi e(Qe rn) fis (47.2) 


+ The current j must not be confused with the angular momentum J. 
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with the notation 
s 1 4 c . 
(Qhin’) fi = j+l Ve5] rx yi: VF Yim) d°x. (47.3) 


These are called the 2!-pole magnetic transition moments. 

Because of the analogy between the expressions (47.2) and (46.6) for the 
emission probability, we obtain a formula which differs from (46.10) only in that the 
electric moments are replaced by magnetic moments. Formula (46.12) for the 
angular distribution also remains valid (as has already been mentioned in con- 
nection with (7.11)). 

Let us analyse the form of (47.3) when j = 1. In this case, the functions are 


4 ; 4 _ 
\< VY jo =4Z, /2 rY, +, = = 75 (x + iy), 


and their gradients are simply the spherical unit vectors e®, e” (7.14). The 
quantities e(Q\)¢i are therefore the spherical components of the vector 


Py = se fe x Vii d*x, (47.4) 


which is similar in form to the classical magnetic moment (see Fields, §44). The 
total probability of M1 radiation is given in terms of this quantity by the formula 
(in ord'nary units) 

w= (4w°/3hc°)|pyl’. (47.5) 
We shall show how formula (47.4) is related to the usual non-relativistic quantum 


expression for the magnetic moment operator. 
The expression for the transition current is (see QM, §115) 


: l ‘i 
In =~ IV — VF) + a curl(¥8y;), (47.6) 
where p is the magnetic moment of the particle and s its spin. Hence 


py = - 7 | we(r x V)y; d*x + = | wi(r x Vp d°x tae | r X curl(ys*8y;) d°x. 
(47.7) 


In the second term, we write 
| Wir X Vt d°x = — { wr x V) yy d?x + curl(rys*y;) d°x. 


The last integral can be transformed into one over an infinitely distant surface, and 
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is zero. Thus the first two terms in (47.7) are equal. In the third term, we transform 
the integral as follows (temporarily writing F = w¥sv;): 


[ex (VXxB) ax = rx (EX F)~ | (FX V) Xr dx 


The surface integral is zero, and in the last term 


(FX V)xXr=—Fdivr+F = — 2F. 
Thus, 


[ ex curl F d°x =2| Fax 
The expression for p, therefore becomes 
ae eae a) 
My [ ox E+E s)yid'x, (47.8) 


where L = ~ir XV is the particle orbital angular momentum operator. This is, as it 
should be, the matrix element of the operator 


p=5—Lb+es, (47.9) 
which contains the operators of the orbital and intrinsic magnetic moments of the 
particle. 

The selection rules for magnetic multipole radiation are analogous to those for 
the electric case: the rules (46.15), (46.16) again apply to the total angular momen- 
tum, and the parity rule is 


P,P; = (-1)"", (47.10) 


which is obtained by substituting in (46.17) the parity of the Mj photon, P,, = (—1)'"". 


$48. Angular distribution and polarization of the radiation 


The formulae derived in §$46 and 47 relate to the emission of a photon with 
definite values of the angular momentum j and component thereof m. It was 
accordingly assumed that the radiating system (a nucleus, say) has not only definite 
values of the angular momentum J but also definite polarizations, i.e. values of M, 
both before and after the emission. 

Let us now consider the more general case of emission by a partially polarized 
nucleus (whose dimensions are again assumed small in comparison with the 
wavelength). The emitted photon again has a definite angular momentum Jj, but may 
be partially polarized. Let us find the emission probability as a function of the 


QE4-M 


174 Radiation §48 


direction n of the photon. This probability must be expressed in terms of density 
matrices which describe the polarization states of the nucleus and the photon. 

For this purpose, we shall first write down the emission probability as a 
function of the direction n and helicity A of the photon (A = +1), for the case 
where the initial and final nuclei have definite values Ji, Mi; J;, My. 

The matrix element for emission of a photon with definite values j,m is 
proportional to the matrix element of the (electric or magnetic) 2'-pole moment of 
the nucleus: 


(J>My 5 jm|V|JiMi) % (—1)"(J¢M;|Q;, -m|JiMi). (48.1) 


The wave function of the emitted photon (in the momentum representation) is 


m) 


proportional to Y{(n) or Y\"?(n). The wave function of a photon whose momentum 
is in the direction n and whose helicity is A is proportional to the polarization 
vector e’’. The matrix element for emission of a photon n, A is found by 
multiplying (48.1) by the projection of the wave function of the state |jm) on that of 
the state |nd): 


(J;M;; nd|V\JSiM;) & (— 1)" (JM; | Q; -m|JiMie* - Yim. 
According to (16.23), for photons of either type 
e*. Y,,.(n) « D® (n). (48.2) 
The matrix element of the multipole moment can be expressed in the usual way in 


terms of the reduced element. Thus we find the transition probability amplitude in 
the form 


(JyMy; ma |VJiMi) % (—1ymerm(_ : Jy )epim, (483) 


where Q denotes (J;||Q||Jj). 

We can now proceed to the general case of mixed polarization states. According 
to the general rules of quantum mechanics, the transition probability is proportional 
to the expressiont 

>, (JMy; mA|V|JiMi)J;M f; m0'|V [JM 1)* x 
(m) 
x (Mi|p°|M iM jp? |MyA'|p™A), (48.4) 


1 If the initial and final states of the system are described by the superpositions 


YO = ane, B= S) Dann, 


n 


then the matrix element is 


(f[ Vi) = S D&anVinn, 
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where p"’, p”, p™ are the density matrices of the initial nucleus, the final nucleus 
and the emitted photon; the symbol (m) beneath the summation sign indicates that 
the sum is taken over all the m-type quantities which occur twice (Mi, M;, M;, Mj, 
r,A'). Then (48.3) is to be substituted in (48.4). 

Let w(n) do denote the probability of emission of a photon into the solid angle 
do. The total probability of emission, in any direction and with any polarizations of 
the photon and the final nucleus, is evidently independent of the initial polarization 
state of the nucleus, is given by formulae already known, and is of no interest here. 
We shall therefore arbitrarily normalize the probability w(n) to unity. The result isTf 


wane (2) + ue + 1) > (<1) MMO DO*, x 


—~M,; -—m Mi/\-M; —m' M; 
x (M;|p|M})(M jp? |M;)(A'|p A); 


it will be seen below that the normalization is correct. This formula can be 
transformed by using the series expansion QM (110.2) for the product of the two D 
functions: 


DYDVE = (-1)"*" DED, -n 


—(—4pim @ j ae j ) (L) 
(—1) » 2L+1{, ge alla at Sq D®, 


where A=A—A’', »w =m-—mi' and L takes integral values =2j. Thus we have finally 


w(n = CLE yet ae 1) Mi “Mitmsl(9T +1) x 


“b j ae j an Jy] Il jy J ye 
h -d' —A]\m —m’ ~p/\-M, —m M,/\—M; —m' M! 


D¥.0n)(Mi|p®|M ')(M jp |M;)(A'|p 1A). me) 


and its square is 


KALVIDP = Vann Vin dnd ¥Dm'b ¥. 


n,n',m,m' 
The case of mixed states is obtained by making the changes 


Anat > pur’, Dm'bé, > pens 


so that 


fl Vil? > > VinnV Xn'p pom. 


n,n’, m,m' 


+ In transforming the sign factor note that the numbers 2J;, 2J;, 2M;, 2M; have the same parity; J and 
m are integers, and A =+1. 


176 Radiation §48 


As previously, 2im,) denotes summation over all m-type quantities which occur 
twice. Here it must be noted that A and A’ differ from the other quantities, since 
they have only two values, A, A’=+ 1, corresponding to the two polarizations of 
the photon, and not 2] + 1 values for any given J. 

Formula (48.5) embodies all the necessary information about the angular dis- 
tribution and polarization of the emitted photons, and also about the polarization of 
the secondary nuclei (i.e. those which have emitted a photon). It is assumed that 
the initial density matrix is given. 


ANGULAR DISTRIBUTION 


The angular distribution of the photons is obtained by summation over all 
polarizations of the photon and the secondary nucleus. The averaging with respect 
to polarizations is done by substituting the density matrices of the unpolarized 
States: 


(Alp™IA =F, (M;y|p®|M) = (48.6) 


_li is 
py Pens Wea 


after which the summation amounts to a multiplication by 2 for the photon or by 
2J;+1 for the nucleus. Thus the summation is effected by simply making the 
changes 


(lpA Bi, (My P1M9) > Sayan, (48.7) 
and the angular distribution is 


=AADEATYS SH" "OL + DDR) x 


LL (m) 


(} J oe j = Jp 1)( Jj j aye 
A —-d 0/\m —m' —p/\—-M,; —m M,/\—M; —m’' M; 


x (Mi|p|M?). 


w(n) 


This formula can be considerably simplified by carrying out the summation over 
m-type quantities. First of all, we note that 


Jog 0 6OLY\_yytf do] a 
« ~ 0) =! I) ee A O07? (48.8) 
and therefore 


joj 0)= ( j 5) 
Pa er 2\) “1 0 for even L, 


=() for odd L. 


In the sum over L, therefore, only the terms with even L remain, and it involves 
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only even-order spherical harmonics D{?. This result is obvious a priori, since, by 
the conservation of parity, the probability must be unchanged by inversion, 1.e. by 
putting n> ~—n. 

Thus we have 


a(n) = Et DES + D ( j a L) 
w(n) ra 2X 2L+ 1); “1 9 Poem) x 
cy a ee © )( Jp J 1) Jj J i) 
x 2 ( ” c —m' —p/\—M; —m M;/\—M; —m' M; . 
x (Mi|p|M’). 
The normalization here is easily verified: with the formula 


| D§2(n) do/4a = Or0 5.05 


integration over all directions leaves only the term with L=0, » =0, and the 
formulae 


ye om a eee 


J; J A) l (i) 
2, (iy, an Me) Bee Pe 


then show that the integral is equal to unity. 

The further summation with respect to m, m', M; in the inner sum in w(n) is 
effected by means of QM, (108.4). The final expression for the photon angular 
distribution is 


y(n) uae (—1) its (2] + IV (2; a 1) x 


4a 

ai joj oe Ji A De PL) 

« > Ci VeL+n(4 a 5) | jf gf PurDia), 
(48.9) 

where 

) GL | ayyemil Fe 2 Si Nag rags 

Pi = EV (QL + DAH+ DIS DM ag yg, (Mile IM, 
(48.10) 


a iia Gand 0 MEA a ere 


The inner sum in (48.9) is taken over all |u| <L, and the outer sum over all even L 
such that 


LS2j, LsS2J,. (48.11) 
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(These conditions result from the triangle rule which has to be satisfied by the 
quantities in the 3j-symbols that appear in (48.9), (48.10).) The number of terms in 
the sum is therefore usually small. For instance, when J; = 0 or 3, only the term with 
L = 0 remains, and the radiation is isotropic; this term is easily seen to equal 4, as it 
should by the normalization condition. When J; = 1 or 3/2, or j = 1, the two terms 
with L =0 or 2 remain in the sum over L. If the density matrix p“ is diagonal 
(M; = M}), then px = 0, and the distribution function (48.9) becomes an expansion in 
Legendre polynomials; according to (16.5) and QM, (58.23), the functions D§} are 
the functions P;, (cos @). Finally, if 


1 
TO), es ee ee 


i.e. if the initial nucleus is unpolarized, then all the PX, are zero except Pi} = 1.4 

The quantities P,, are convenient characteristics of the polarization state of the 
nucleus, and will be called polarization moments. Formula (48.10) defines them in 
terms of the density matrix pyy:. The inverse formula expressing the density matrix 
in terms of the polarization moments is easily verified: 


7 7s 1m ot £ 


Let f1, be a spherical tensor depending on the polarization state of the nucleus. 
According to the general rules (see QM, (14.8)), its mean value in a state having the 
density matrix py 1S 


fu = os pum<JM'|f.,.|JM). (48.13) 


Expressing the matrix elements of the f,, in terms of the reduced element (J||f,||J) 
by means of the formula 


IM 'funlIM) = iD (yey OIF. 


+ Using the result that 


& 0 recy 1 ies 
-~M’' 0 M/~ V (25 +1) MM” 


we have 


ee ee oe J oLJvys og 
Se ome VOIDS (te ane 0 a) 


= V(2I of 1) S10 540, 


and the conclusion stated then follows from the definition (48.10). 
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and using the polarization moments as defined by (48.10), we obtain 
a7 (Jf) 
fin = VI@L + NQI+ D] 2 8 


PHOTON POLARIZATION 


When the matrices p™ and p”, as well as p, are specified, formula (48.5) 
determines the probability of a transition in which a photon is emitted, and the 
nucleus left, in definite polarization states. Such states are essentially characteristic 
not of the emission process as such, but of the detectors which record the photon 
and the recoil nucleus and distinguish definite polarizations of these. There is 
another and more natural formulation of the problem, in which the final state of the 
“nucleus + photon” system is not specified from the start, and the polarization 
density matrix of this state is to be determined, with only the direction of the 
photon emission fixed. 

The answer to this problem is given by the same formula (48.5). If this is written 
as 


w = w(n) >, (My; nA|p|Mj; nX')(A'|p™|A)(M jp M5), (48.15) 


then the expression (M;; nA|p|M;; nd’) is the required density matrix, since accord- 
ing to the general rules of quantum mechanics the probability w of a transition to a 
specified state is given by its ‘“‘projection” on the given p™, p“. The factor w(n) is 
written in (48.15) so that this matrix shall be normalized by the usual condition, 


>, (Mj; nd|p|M;; nA) = 1. 
X, My 
If we want the polarization of the photon alone, a summation over M; = M; is 


necessary: 


(nd|p|nr') = > (My; nA|p|M;; nX’). 


My 
Using a derivation exactly similar to that of (48.9), we obtain 


(27+ 1)V(2J, +1) . 


Wea fe ay ees 


x> -)'VEL+D( ae ne : Ap PO *DE (Cn), (48.16) 


where A =A—A’, and the summation is over all integral values of L which satisfy 


the conditions (48.11). 
In particular, circular polarization is determined by the Stokes parameter 


2 = (n1|p|n1)— (n, —1|p|n, —1); 
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see §8, Problem. Because of the relation (48.8), all terms with even L in this 
difference are zero, and the resulting formula for &, differs from (48.9) only in that 
the summation is over odd instead of even values of L. 


SECONDARY NUCLEUS POLARIZATION 


Finally, if we are interested only in the final polarization of the nuclei, we must 
put p” > 6. If the integration with respect to directions of the photon is also carried 
out, the density matrix of the secondary nucleus is 


(Mjlp|M}) = | w(n)(Myn|p|M jn) do 


= Qt 1). 2. Ely x 
m, Mi, M;: 


J _ ui ji On VE 
«(a -m M, (_ —m ui )(Mile |M'). 


The polarization moments calculated by means of this matrix are 


PY, = (DH QT, + (25; + Dy ee ||P e. (48.17) 


Jp Jp j 

If the initial nucleus is unpolarized, so is the final nucleus, but there exists a 
correlation polarization, i.e. a polarization of the nucleus after emission in a 
specified direction. Putting p“ > 6/(2J; + 1) (and correspondingly w(n) = 1/47) and 
calculating as in the derivation of (48.9), we obtain for the density matrix describing 
this polarization 


; ', so fe _4)\5+Mpt1 ] ] rll Jj L “ 
(MsnleIMjsm)=Qi+ D-n¥"" YS @L+o(, 1) og) ow, oo oM)* 


Jp Jp LY nw 
a 1 f Dw. (48. 18) 


The corresponding polarization moments are 
PY = it(-1)'"402] + DVIQL + DQI; + D1 x 


A, AG 4 ow. am 


Only even-order moments occur (which is also a consequence of the conservation 
of parity already mentioned). 

If the secondary nucleus in turn emits a photon, it will generate an anisotropic 
distribution, being polarized. Since the polarization moments (48.19) depend on the 
direction n of the photon emitted in the first decay, there is a certain correlation 
between the directions of successively emitted photons (with an unpolarized 
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primary nucleus). Other correlation effects (of polarization, etc). in cascade emis- 
sion can be treated similarly. 


PROBLEM 


Find the relation between the polarization moments ¥,, Po, and the mean values of the angular 
momentum vector J and the quadrupole moment tensor Qik. 
SOLUTION. The reduced elements of the vector J and the tensor Qx are determined from 


J = (IY /2T +b, 
Qik = JQP /2I + 1); 


cf. QM, (107.10), (107.11). The operator Qi is expressed in terms of the angular momentum operators as 
in QM (75.2): 


, 30 35 457225 
Qik = 2723 ~1) (SJk + Jedi — 33° 8x). 
Hence we find the mean value 


—- 3Q va a q27f3 + DOS +3) 
Qik = spas 4 = | OF —N |. 


The reduced matrix elements are 
JJ) = VII + DAI + DI, 


327 + 1)\J + DQT +307 
ols» = [22 ee 


From (48.14) we now see that the polarization moments ?), are equal to the spherical components of 


the vector 
3 a 
Via iy” 


and the moments #2, are equal to the spherical components of the tensor 


10J(2F — 1) ec 
30+ DQI+3)! Q° 


§49. Radiation from atoms: the electric type 


The energies of the outer electrons of an atom (which take part in optical 
radiative transitions) have, as a rough estimate, the order of magnitude E ~ me‘/h’, 
so that the radiated wavelengths \ ~ hc/E ~ h’/ame’. The dimension of the atom is 
a ~ h*/me’. Thus, in the optical spectra of atoms, we generally have the inequality 
al\ ~ a <1. The ratio v/c ~ a, where v is the velocity of the optical electrons, has 
a similar order of magnitude. 

Thus, in the optical spectra of atoms, a condition is satisfied which means that 


+ A detailed account of these problems is given in the paper by A. Z. Dolginov, in: L. A. Sliv, ed., 
Gamma-luchi (Gamma Rays), USSR Academy of Sciences, Moscow 1961, pp. 523-681. 
t In §§49-51 and 53-55, ordinary units are used. 
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the probability of electric dipole radiation (if this is allowed by the selection rules) 
considerably exceeds the probabilities of multipole transitions.t For this reason it 
is electric dipole transitions which are the most important in atomic spectroscopy. 
As has already been mentioned, such transitions are subject to strict selection 
rules as regards the total angular momentum J of the atom and the parity P:# 


\J’-J|<1sJ4+J, (49.1) 
PP’=-—1. (49.2) 


The inequality |J’—J|<1 signifies that the angular momentum J can change only 
by 0 or +1; also, the transition 00 is forbidden by the inequality J + J'21. The 
parities of the initial and final states must be opposite.§ 

The probability of emission by the transition nJM —>n’J’M' is determined by 
the corresponding matrix element of the dipole moment of the atom: 


4w? 
3hc? 


w(nJM > n'J'M') = <3 [(n'J'M'|d_n|nJM)|’, (49.3) 


wo =w(nJ on’'s'). 


On summing (49.3) over all values of M'= M—~m (with M given), we obtain the 
total probability of emission with a given frequency from the atomic level n, J. The 
summation is carried out by means of (46.20), and the result is 


3 
w(nd > n'T') = S25 * nS dl|nJ) (49.4) 


3hc* 25 + 


The squared modulus of the reduced matrix element is sometimes called the 
transition line strength; it is symmetrical as between the initial and final states. 

The observed radiation intensity is found by multiplying w by hw and by the 
number N,,; of atoms in the source which are at the excitation level concerned. For 
example, in a gas at temperature T this number is N,) « (2J + 1) exp(— E,,/T); the 
factor 2J + 1 1s the statistical weight of the level with angular momentum J. 

Further deductions regarding transition probabilities in atomic spectra can be 
obtained only for specific kinds of atomic states. We shall not here discuss methods 
of calculating matrix elements where the degree of approximation has no clear 
theoretical significance, but simply derive some relations valid for a fairly large 
class of states (especially in light atoms) of the LS coupling type (see QM, $872). 
Such states are described not only by the total angular momentum but also by 
definite values of the orbital angular momentum L and the spin S, which in this 
case are conserved. 


+t Typical values of the dipole transition probability in the optical region of the spectra of atoms are of 
the order of 10° sec '. 

t We shall now denote the quantum numbers of the initial and final states by unprimed and primed 
letters respectively. The letters n,n’ will denote all the quantum numbers which define the state of the 
system, other than those shown explicitly. 

§ The parity selection rule was first established by O. Laporte (1924). 
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Since the dipole moment is a purely orbital quantity, its operator commutes 
with the spin operator, i.e. its matrix is diagonal with respect to the number S. For 
the number L, the dipole moment is subject to the same selection rules as any 
orbital vector (see QM, $29). Thus transitions between LS-type states are subject 
to the following selection rules (in addition to (49.1), (49.2)): 


S'—S=0, (49.5) 
IL'- LI S15 L+L’. (49.6) 


It should again be stressed that these rules are approximate, and no longer apply 
when the spin-orbit interaction is taken into account. 

The rule (49.5), which forbids transitions between terms of different multi- 
plicity, is valid not only for electric dipole transitions but for all electric transitions: 
the electric multipole moments of all orders are orbital tensors, and therefore their 
matrices are diagonal with respect to spin. For instance, for electric quadrupole 
transitions, in addition to the general rules 


y= JS 2a eS, PP =, (49.7) 
in the case of LS coupling we have the further rules 
S'-S =0, I=L) S26 04-7". (49.8) 


The emission probability can be written in explicit form as a function of the 
numbers S, J, J’. This is done immediately by means of the matrix elements of 
spherical tensors in the addition of angular momenta. According to QM, (109.3), we 
havet 
| Or ae 


Kin’ L'SJ'\\d||InL SJ)? = (27 + 1)(2J'+ D| eo 


PV iin'Lifd|mLyp. (49.9) 


Substitution of this in (49.4) gives 


J' 
L 


3 2 
w(nLSJ > n'L'SJ') = $2, (20+ Dy; "| Kn'Li\d|nL)P, (49.10) 


with w = w(nLS —n'L'S).£ 
A sum rule can be derived for these probabilities. The squares of the 6j]-symbols 
satisfy the summation formula (see QM, (108.7)) 


i ae 28ye.. - A 
Ses + fy 7 - oF (49.11) 


+ The “angular momenta of sub-systems 1 and 2” in the formulae in QM, §109, are here to be taken 
as the orbital angular momentum and spin of the atom, whose interaction is neglected; the quantities fia 
are represented by the orbital vector dq. 

+ In neglecting the spin-orbit interaction in the calculation of the matrix elements, we also neglect 
the dependence of the frequencies on J and J’, i.e. the fine structure of the initial and final levels of the 
atom. 
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Using this, we obtain from (49.10) 


iy ‘\y 4w° I yr 2 
» w(nLSJ > n'L'SJ') = she DLA (n'L'|\d||nL)/’. (49.12) 


This quantity is thus found to be independent of the initial value of J. 

For radiation from a gas whose temperature is much greater than the fine- 
structure intervals in the atomic term nSL, the states with different J are uniformly 
occupied, i.e. all values of J are equally probable. The probability that the atom is 
at a level with some definite value of J is then 


2J +1 
QL+ QS +1) es 
1.e. is equal to the ratio of the statistical weight of the level to the total statistical 
weight of the term nSL. Averaging the expressions (49.10) or their sums (49.12) 
with respect to these probabilities is equivalent to multiplying by the factor (49.13). 
This averaging will be denoted by a bar over the letter w. The total probability of 
emission of all the lines in a spectral multiplet (formed by all possible transitions 
between the fine-structure components of the two terms nSL and n'SL’) is the sum 


w(nLS > n'L'S) = > > w(nLSJ > n'L'SJ’). (49.14) 
Jos’ 
Since, of course, 


> 23 +1) =(28+ )2L +1), 
J 


the result obtained for the total probability agrees with (49.12). Thus the relative 
probability (which is the same thing as the relative intensity) of a single line is 

w(nLSJ >n'L'SJ') _ (25 + 12’ +1) {(F J’ sy (49.15) 

w(nLS > n'L'S) 2S +1 JI L jo 

The analysis of the numerical values given by this formula shows that the 

strongest lines in the multiplet are those for which AJ = AL (called main lines, 

while the remaining components of the multiplet are called satellites). The intensity 

of the main lines increases with the initial value of J. 
Summation of the quantities (49.15) with respect to J’ and J gives respectively 


inate a 
w(nLS — n'L'S) (2L + 1)2S + 1) 
(49.16) 


wnLS>n'L'S) (2L+D)2QS+1) 
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Thus the total intensity of all the lines in a spectral multiplet having a common 
initial or final level is proportional to the statistical weight of that common level. 

We may also consider the hyperfine structure of atomic spectral lines. The 
hyperfine splitting of atomic levels is due to the interaction of the electrons with 
the spin of the nucleus if the latter is non-zero (see QM, §122). The total angular 
momentum F of the atom (including the nucleus) consists of the total electron 
angular momentum J and the angular momentum I of the nucleus. Each component 
of the hyperfine structure of the level n,J has a different value of the quantum 
number F. 

The rigorous law of conservation of angular momentum now leads to a rigorous 
selection rule for the total angular momentum F: for electric dipole radiation, 


|IF’- F|<1<F+F'. (49.17) 


But, in view of the extreme weakness of the interaction of the electrons with the 
spin of the nucleus, this interaction may be neglected in calculating the matrix 
elements of the electric (and magnetic) moments of the electron shell of the atom. 
Thus the previous selection rules regarding the electron angular momentum J and 
the electron parity remain valid also. In particular, the latter selection rule prohibits 
electric dipole transitions between hyperfine structure components of the same 
term: all these levels have the same parity, whereas such transitions can occur only 
between states of different parity. 

Since the dipole moment operator commutes with the nuclear spin, the depen- 
dence of the matrix elements on the numbers I and F can be found explicitly, the 
calculations differing cnly by an obvious change of notation from those given 
above for LS coupling. The probability of emission, summed over the final values 
of the component of the total angular momentum F, is 


4w° 1 


Se spy Kn IF ld |inJIF YF, 


w(nJIF > n'J'IF') = 
(49.18) 


w = o(nJ -n'J'), 
and the square of the reduced matrix element is 


ye ok 


2 
F J 1} Kn’ S'dlndyf (49.19) 


Kn'J'IE'\\d|\nJIF) |? = (2F + 1)(2F' + » 


PROBLEM 


The majority of the lines in the spectra of the alkali metals can be described as resulting from 
transitions of a single outer (optical) electron in the self-consistent field of the rest of the atom, which 
forms a configuration of closed shells; the state of the atom is governed by LS coupling. Under these 
conditions, determine the relative intensities of the fine structure components of the spectral lines. 


SOLUTION. The total angular momenta L and S =3 of the atom are equal to the orbital angular 
momentum and spin of the optical electron. The parity of the state is therefore (—1)” (the parity of the 
closed configuration of the rest of the atom being positive). The parity selection rules therefore forbid 
the dipole transition with L’ = L, and so only transitions with L’- L = +1 are possible. The transitions 
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between components of the doublet levels n,L and n’,L—1 give only three lines, because of the 
selection rule for J (Fig. 1). Their relative intensities (denoted by a, b, c) are most simply determined 
from the rules (49.16), instead of using (49.15) directly. The ratios of total intensities of lines having each 
initial (or final) level give two equations 


b+c  2L a+b 2L 
a es? cc 2L-2’ 


whence 
a:b:c =(L+)DQL—1):1:(L-DQL+4 1). 


If L = 1, the lower level is unsplit, line c does not appear and a/b =2. 


§50. Radiation from atoms: the magnetic type 


The magnetic moment of an atom is equal, in order of magnitude, to the Bohr 
magneton: p ~ eh/mc. This differs by a factor a from the order of magnitude of the 
electric dipole moment, d ~ ea ~ h*/me (since v/c ~ a, we have pz ~ do/c, as is to be 
expected). Hence it follows that the probability of magnetic dipole (M 1) radiation 
from the atom is about a’ times less than that of electric dipole radiation at the 
same frequency. The magnetic radiation is therefore important in practice only for 
transitions forbidden by the selection rules for the electric case. 

The ratio of the probability of electric quadrupole (E2) radiation to that of M1 
radiation is, in order of magnitude, 


Oe A 


F2 Pi OE fee Bee Dae AE\? 
ee) ven 


the quadrupole moment ~ea’, E ~ h?/ma’ is the energy of the atom, and AE the 
change in energy in the transition. We see that, for medium atomic frequencies (i.e. 
when AE ~ E£), the probabilities of E2 and M1 radiation are of the same order of 
magnitude (assuming, of course, that both are allowed by the selection rules). If, 
however, AE <E (as for transitions between fine structure components of the 
same term), then M1 radiation is more probable than E72. 

The magnetic dipole transitions are subject to the rigorous selection rules 


IJ’ -JI|<1<J4J', (50.2) 
PP’ =1. (50.3) 


§50 Radiation from Atoms: The Magnetic Type 187 


For LS coupling, there are additional selection rules, which are even more 
restrictive than in the electric case. This is because of a particular property of the 
magnetic moment of the atom, which arises from the fact that all the particles 
(electrons) in the system are identical: the magnetic moment operator of the atom 
can be expressed in terms of the total orbital and spin angular momentum 
operators: 


fi = wo (L + 28) = — po(S +S), (50.4) 


where pto=|el|h/2mc is the Bohr magneton (see QM, $113). Owing to the con- 
servation of total angular momentum, the operator J has only matrix elements 
which are diagonal with respect to the energy, and in considering radiative tran- 
sitions it is therefore sufficient to put fa = — poS.7 

The angular momenta L and S are separately conserved when the spin-orbit 
interaction is neglected. The spin operator is therefore diagonal in all the quantum 
numbers n, S, L which belong to the unsplit term. In order for a transition to occur 
at all, the number J must change. The selection rules are consequently 


n'=n, S'=S, L'=L, peLe et, (50.5) 


i.e. transitions are possible only between fine structure components of a single 
term. 

The emission probability can be calculated exactly in this case. By an ap- 
propriate change of notation in formula (49.10), we obtain 


S. J 2 


fo HO Oy 4 D7 5 ¥ (S|]S|]S)/7. 


w(nLSJ > nLSJ') = 


The reduced spin matrix element with respect to the spin eigenfunctions is 
(S||S||S) = V[S(S + DS + 1)]; (50.6) 
see QM, (29.13). The 6j-symbol is 


{‘ J-1 it (L+S4+J+1)(L+S—-J+4+1)(L-S+J\(S - Pos) 
1 


J S SQ2S + DS +2)2QJ —1)2I QJ + 1) 
(50.7) 
see QM, §108, Table 10. The result is then 
J+1 
w(nLSJ > nLs, J —1)= sy — 7 WMLs, J—-—1--nLSJ) 
@ 46 a 
= FiclQ + Dd DJ (L+S+J+1)(L+S-J+1)x 
~~ eS = D)G+ Ls). (50.8) 


+ An exception occurs in cases where the electronic angular momentum J of the atom is not 
conserved: when the hyperfine structure is taken into account, when an external field is present, and so 
on (see the Problems). 
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Transitions between hyperfine structure components of one level (whose 
frequencies are in the radio wave range) cannot occur as electric dipole transitions, 
since all the components have the same parity. E2 and M1 transitions involve no 
change of parity. But, owing to the relatively very small intervals in the hyperfine 
structure, E2 radiation has a low probability compared with M1 (cf. (50.1)), so that 
these transitions occur as magnetic dipole transitions. 


PROBLEMS 


PROBLEM 1. Find the probability of an M1 transition between hyperfine structure components of a 
single level. 


SOLUTION. The transition probability is given by formulae (49.18), (49.19), in which the diagonal 
reduced matrix element (nJ||u||nJ) of the magnetic moment will now appear. Its value can be written 
down immediately by noting that the total (not reduced) matrix element (nJM|.|nJM) determines the 
splitting of the relevant level by the Zeeman effect (see QM, §113), and is —ogM, where g is the Landé 
factor. The reduced matrix element is (see QM, (29.7)) 


(nJ lind) = 55 VU + 1) + 1)KnJMs|nJM) 
= — pog V[J(J + 12S + VD). 
The required probability is thus found to bet 


w(nJIF > nJI, F - 1) = i 


+ W(nJT, F - 1 > nJIF) 
w Log” 7 : 7 
“COR +DES TI+F+DG+I-F+ DF +I -IKF-J+D. 


This expression differs from (50.8) only by an obvious change of notation and the extra factor g*. 


PROBLEM 2. Find the probability of an M1 transition between Zeeman components of a single 
atomic level. 


SOLUTION. This is a transition M—>M —1 with the values of n and J unchanged; the transition 
frequency is (see (51.3) below) hw = wogH, where g is the Landé factor. The matrix element of the 
spherical component yp-1 of the vector p is 


J-M+1)\(J+M 
Kd, M = tps = fF tft) 


= — pog V[J —M + 1)(J + M)] 


(see QM, (27.12), and Problem 1). The transition probability is 


3 
wes . KnJ, M — 1|u-i]nJM)| 


= 2uaH (J-M+1)\(J+M). 


+ An interesting example is the transition between the hyperfine structure components of the ground 
level (1s1) of the hydrogen atom, where both E1 and E2 transitions are strictly forbidden, the latter by 
the rule which prohibits a quadrupole transition with J+J’'= I. This transition has a frequency 
w = 2m X 1.42 x 10’ sec ' (wavelength \ = 21cm). Putting g =2, I =3, J =3, F =1, F’ =0, we obtain 


W = 40 pn i3hc* = 2.85 x 10°" 
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§51. Radiation from atoms: the Zeeman and Stark effects 


In an external magnetic field H (assumed weak), each atomic level with total 
angular momentum J 1s split into 2J + 1 levels, 


Ey = E® + wogMH, (51.1) 


where E™ is the unperturbed level, wo the Bohr magneton, g the Landé factor, and 
M the component of J in the direction of the field (see QM, §113). Thus the 
degeneracy with respect to directions of the angular momentum is entirely 
removed. 

Spectral lines resulting from transitions between two split levels are cor- 
respondingly split. The number of components of the line is determined by the 
selection rule for the number M, according to which, for dipole radiation, we must 
have 


m= M-—M'=0,+1. (51.2) 


In addition to this rule, the transitions with M = M'=0 are forbidden if also 
J'=J. This is seen immediately from the general expressions (QM, (29.7)) for the 
matrix elements of an arbitrary vector. 

The components resulting from transitions with m =0 and m = +1 are called 7 
and o components respectively. Their frequencies are 


ho, = ho + poH(g — 2g’)M, 


(51.3) 
ho, = hw + poH[gM — g'(M +1)}. 
In the particular case where g = g’, we have 
ho, = hw, hw, = ho = pwogH, (51.4) 


whatever the value of M; thus, in this case, the line is split into a triplet with an 
undisplaced 7 component and two o components lying symmetrically on either 
side of it (called the ‘“‘normal’’ Zeeman effect). 

The total probability (for all directions) of emission of radiation is proportional 
to the squared modulus |(n’J’M’|d_,,|nJM)|’. Hence, using formula (46.19) with 
j=1, we see that the relative probability of emission of each of the Zeeman 
components of the spectral line is 


’ 2 
ei at ois 


In the particular case of the ‘normal’? Zeeman effect there are only three 
components, each arising from transitions with all initial values of M for given m. 
Since 


x (en "3 Or®) 


M, M’ m 
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(see QM, (106.12)), the emission of all three components is equally probable in this 
case. 

The relative intensity of the Zeeman components when observed in a particular 
direction (relative to the direction of the magnetic field applied to the source) is of 
greater interest, however. According to (45.5), the probability of emission (and 
therefore the line strength) in a given direction n is proportional to = |e* - d,|’, 
where the summation is over the two independent polarizations e which are 
possible for a given n. 

For observation along the field (along the z-axis), this sum is 


(de gil? + \Cdy gil’, 


or, in spherical components, 
dpi? + [(d_1) gil’. 


This means that only the two o components (m=+1) are observed in the 
longitudinal direction (along the field). Their intensities are proportional to 


ypoo4sfsy 
ei +] my of) 


These lines have definite values of the component m of the angular momentum in 
the direction of propagation, and either right-hand (m = 1) or left-hand (m = — 1) 
circular polarization (see §8). 

For observation in a direction perpendicular to the field (along the x-axis, say), 
the intensity is proportional to the sum 


(dz )si|? + |Cdy gi]? = [(do) ji? + I(d1)ji > + |(d_-1) 4:17}. 


Thus two o components and a m component are observed in the transverse 
direction, with respective intensities proportional to 


A ee Ge yyaegsy 
ee +] av ang (uu 0 a ron 


the intensities of the o components are half as great as in the case of longitudinal 
observation. The wm component is linearly polarized along the z-axis; the o 
components, as observed in this direction, are linearly polarized along the y-axis. 

The relative intensities of the Zeeman components are seen to be entirely 
determined by the initial and final values of J and M, and not by any other 
properties of the levels. 

The selection rules forbid electric dipole transitions between Zeeman com- 
ponents of the same level, since all these have the same parity. Such transitions 
occur as magnetic dipole transitions, for the same reason as was mentioned at the 
end of $50 in respect of transitions between hyperfine structure components of a 
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level. Because of the selection rule for the number M, the transitions occur only 
between adjacent components (M’— M = +1).t 

The splitting of atomic levels in a weak electric field (the Stark effect), unlike 
that in a magnetic field, does not cause complete removal of the degeneracy with 
respect to directions of the angular momentum. All the levels except those with 
M =0 remain doubly degenerate, each corresponding to two states with angular 
Momentum components M and —M. 

The calculating of the relative intensities of the Stark components of a spectral 
line is exactly similar to that given above for the Zeeman effect.t It must be remem- 
bered that the intensity of the m components includes contributions from the 
transitions M—M and -M—>-—M (when M#0), and that of the o components 
includes contributions from the transitions M>M+1 and -M—>-(M +1). 
Hence, for example, in transverse observation the intensities of the 7 components 


are proportional to 
2(J PY ae ) 
M 0 —-M?’ 


and those of the o components are proportional to the sums 


3( Us 1 J ) +3 J 1 a) J’ 1 J ) 
2\M+1 +1 -M eve +1 M/) \M+1 ¥1 —-M 


(when all the numbers in the second row change sign, the 3j-symbols can at most 
change sign, so that their squares are unaltered). 

In an external field, even if it is weak, the total angular momentum J is no 
longer strictly conserved; in a uniform field, only the angular momentum component 
M is exactly conserved. Thus, in radiative transitions in a weak field, the con- 
servation of angular momentum need not be rigorously maintained, and the atomic 
spectra may contain lines which are forbidden by the usual selection rules. 

The calculation of the intensities of these lines is equivalent to the calculation 
of the corrections in the dipole moment matrix, which in turn requires the 
determination of corrections to the wave functions of stationary states. In the first 
approximation of perturbation theory (with respect to the weak external field), the 
wave function includes “admixtures” of states which are connected to the initial 
state by non-zero matrix elements of the perturbation (—E- d in the electric field): 
the admixture of a state yw in a state yy, is 


—E- d>, 
E,— E, *” 


+ The frequencies of these transitions are usually in the range corresponding to centimetre 
wavelengths, and are observed in absorption and induced emission (electron paramagnetic resonance): 
the absorbing atoms are in a strong constant magnetic field (which causes the Zeeman splitting) and a 
weak radio-frequency field of the resonance frequency. 

+ This refers to the quadratic Stark effect, which occurs in all atoms except hydrogen (see QM, §76). 
The field is assumed so weak that the level splitting which it causes is small even in comparison with the 
fine structure intervals. 
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Thus the matrix element of the ‘“‘forbidden”’ transition contains a term 


= (E i d>;)d3. 
E, - FE, 


which is not zero if transitions from the ‘‘intermediate’’ state 2 to the initial state 1 
and final state 3 are. allowed. 


§52. Radiation from atoms: the hydrogen atom 


The hydrogen atom is the only one for which the transition matrix elements can 
be completely calculated in an analytical form (W. Gordon, 1929). 

The parity of a state of the hydrogen atom is (—1)', i.e. is uniquely determined 
by the orbital angular momentum of the electron (the number [, which defines the 
parity of the state, retains its significance for the exact relativistic wave functions, 
i.e. When the spin—orbit interaction is taken into account). The parity selection rule 
therefore strictly forbids electric dipole transitions without change of 1; only 
transitions with |>1+1 are possible. There is, however, no restriction on the 
change in the principal quantum number n. 

The dipole moment of the hydrogen atom is equivalent to the position vector of 
the electron: d= er. Since the electron wave function in the hydrogen atom is the 
product of an angular part and the radial function R,,, the reduced matrix elements 
of the position vector can also be written as the product 


(n', L— 1 }rl|nl) = (1 — 1p]1) | Ry. -irRar? dr, 
0 


where (I —1||v||l) are the reduced matrix elements of the unit vector v in the 
direction of r. These are 


(1 = Ifo ||) = Ulo|ll — 1)* = iV, 
see QM, (29.14). Thus 


(n', 1 Alfrl|nl) = — (nllfr||n', | — 1) = iV | Roache de. (52.1) 
0 


The non-relativistic radial functions of the discrete spectrum of the hydrogen 
atom are given in QM, (36.13):f 


! 
2 et nt eR nt 141,2142,2rfn). (52.2) 


Ru = "Oleh! Vin -1- 1)! 


im In the present section, atomic units are used. In ordinary units, the expressions given below for the 
matrix elements of the coordinate are multiplied by h’/me’ (or by h?/mZe’ for a hydrogen-like ion with 
atomic number Z). 
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The integral in (52.1), containing the product of two confluent hypergeometric 
functions, is calculated by means of the formulae in QM, 8f.t The result is 


Jp ED [nt Din’ + LD)! Gan y(n = yen 
Y* 4Q1-1)! Va =1= Din’)! (n+n'y 


. {F(- nt+i+1,—n'+1 2, —4nn'(n—n)— 


(n’, L— 1 \r\|nl) = x 


2 
= n—n nas ean eal _— , =! gh 
(2 —) Fens =1,=0 ol Sania) )f (52.3) 
where the F(a, B, y, z) are hypergeometric functions. Since the parameters a and B 
in this case are negative integers or zero, these functions reduce to polynomials.+ 

For reference, the following are the expressions obtained from (52.3) in some 
particular cases (the values of | being indicated by the spectroscopic symbols 
S,p,d,...): 
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Formula (52.3) is not applicable to transitions with no change in the principal 
quantum number n (transitions between the fine structure components of a level). 
In this case (n = n’), the integration is carried out by expressing the radial functions 
in terms of generalized Laguerre polynomials: 


A Geapen@)ume as 


In the integral 


co 


[ Ra aRur? dr « | ep eT (p)List-1(p) dp, 
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we replace one of the polynomials by its expression in terms of a generating function 
(see QM, §d): 


L2 __(n+D)! andy Spel 
Lisi (p )= (n —{-1)! e’p dp (e p ). 


+ In the notation used there, we have to calculate the integral J3i:.(—n +1+1,—n'+1). This is done 
by means of formulae (f.12)-(f.16). 

+ Numerical tabulations of the matrix elements and transition probabilities for hydrogen are given 
by H. A. Bethe and E. E. Salpeter, Handbuch der Physik 35, 88-436, Springer, Berlin, 1957. 
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After n —!—1 integrations by parts, we obtain an integral of the form 


ee) 


fero(t) (pLisi-1(p)) dp, 


0 


in which we replace the Laguerre polynomial by its explicit form: 


rm es n-m n (—p)* 
Lre)=(-Nrat & (  S 


After the differentiation in the sum, only three terms remain, and the integration is 
then elementary. The result is simply 


(n, | — I|rllnl) = iV 1 -3nV(n?—P). (52.6) 


The integral 


oO 


| Ry -iRuar’ dr a Xn'1-10Xn1) dr, 
0 


0 
where yn = rR, is the coefficient in the expansion of the function ry, in terms of 
the orthogonal functions y,)-; (n'=1,2,...). The sum of the squared moduli of 


these coefficients is equal to the integral of the square of the expanded function.t 
Hence 


S Ken’, 1 Ap = 1 fr Py? d (52.7) 


Using the known expression for the mean square of r in the state nl (see QM, 
(36.16)), we obtain the sum rule 


> Kn’, = 1frlinl)? = 2In7[Sn? + 1 — 31(1 + 1). (52.8) 


For given n,l and large n’, the matrix element of the transition nl —n/'!' 
decreases according to 


Kin'l'\r\|n1)?? x 3/n?, (52.9) 
as can be seen both from the particular expressions (52.4) and from the general 
formula (52.3). This result is to be expected: the Coulomb levels of energy 


E'=-—1/2n” have an almost continuous distribution when n’ is large, and the 


+ The summation is over states of both the discrete spectrum and the continuous spectrum. 
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probability of a transition to a level in the interval dE’ is proportional to the density 
of these levels, which in turn is proportional to n’°. 

The Stark effect in hydrogen has the peculiarity that the splitting is proportional 
to the first power of the electric field (QM, §77). Here the field is assumed to be 
both weak enough for perturbation theory to be applicable and such that the level 
splitting is large compared with the fine structure of the levels. Under these 
conditions, the magnitude of the angular momentum is not conserved, and the 
levels have to be classified by the parabolic quantum numbers n,, n2, m. The last of 
these, the magnetic quantum number m, again determines the component of the 
orbital angular momentum along the z-axis (the direction of the field), which is 
conserved under the conditions stated (neglecting the spin-orbit interaction). It is 
therefore governed by the ordinary selection rule 


m’—-m=0, +1. (52.10) 


There is no restriction on the changes in n, and n>. 

The matrix elements of the dipole moment in parabolic coordinates can also be 
calculated analytically. The resulting formulae, however, are very lengthy, and will 
not be given here.t 


PROBLEMS 


PROBLEM 1. Find the Stark splitting of hydrogen levels when it is small compared with the fine 
structure intervals (but large compared with the Lamb shift). 


SOLUTION. Under the conditions stated, there remains a twofold degeneracy of the unperturbed 
levels with |! =j+2, and the Stark splitting is therefore again linear in the field. The splitting A is 
determined from the secular equation 


-A  ~E(d:)ro|_ 


= * 
“pay cape Bee eee, 


the suffixes 1 and 2 correspond to states with | = j +5 and a given magnetic quantum number m; the 
perturbation V = — Ed, is diagonal in m and has no elements diagonal in |. The matrix element of the 
orbital quantity d, is calculated by means of formulae (29.7) and (109.3) in QM: 


; : = m ee . 
. : re (a ee 
Gi.t— Malay = 07+ Df! 1 Ft ha mann, 


where we must put | =j +2; the quantity (I! — 1||d||l) is taken from (52.6), and the result is 


nm E 


ee eee 


PROBLEM 2. Determine the probability of photon emission in the transition 2si> 1s} of the 
hydrogen atom (G. Breit and E. Teller, 1940). 


SOLUTION. The process is strictly forbidden by parity for an E1 transition, and by the rule (46.15) 
for an E2 transition. We have therefore to calculate the probability of an M1 transition, given by (47.5). 


+ These formulae and the corresponding numerical tabulations are to be found in the work by Bethe 
and Salpeter cited above. 
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In the present case (1 =0), however, the magnetic moment is a purely spin quantity, and its matrix 
element is zero if the spin—orbit interaction is neglected, because of the mutual orthogonality of the 
orbital wave functions with different principal quantum numbers. This means that, to obtain a result 


other than zero, the Pauli’s equation approximation is insufficient, and we must start from the complete 
Dirac’s equation. 


In the standard representation of the wave functions, the transition current ist 


in = Wfawi = bfayxit+ xfogi. 


According to (35.1), (24.2) and (24.8), the wave functions of the states with | = 0, j =} are 


ia © 7 77 (_ cee) 


where n = r/r, and w(m) is areal three-dimensional unit spinor corresponding to the spin projection value m. 
Thus 


: I 
in = 75 thrsiwyo(o « n)wi — gefiwy(o - mows}. 
Substituting this in (47.4) and carrying out the integration over the directions of n, we find 
ui = —(el6i)wyo X owl = —iewsowil 


(from the Pauli matrix commutation rules, o X o = 2ia); here 


pe | (frei + fg;)r? ar. (1) 
0 


The photon emission probability (47.5), summed over the values of my, is 
w = (4e?w/27) wie wil’ = 4e7>w°I’/9. (2) 


From (35.4) we have, with x = — 1. 


f' f' ( =) R’ 
= i _ +— amy 
g e+mt+a/r 2m as r/4m”’ 


in the second term, the exact function f is replaced by the non-relativistic radial function R. With the 
approximation g = R'/2m, the integral 


oo 


ttle ie es roe ee 
r= 5 | (RR dr = 5 | RRe dr = 0, (3) 
0 


since Ry and R; are orthogonal. In the next approximation, using (3), we find 
1 ; 1 ; a ' 
T= z | (ffi) rdr+ eR | {R;Ri(ei — ef) os (R;Ri) }r dr. (4) 
0 
Since, from the orthogonality of the exact functions yi and yy, when Ki = Ky, 


| (fify + gigs)r’ dr =0, 
0 


+ In this Problem, relativistic units are used. 
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the first term in (4) may be written, after integration by parts, as 


ioe) 


hf stv? dem | ser? dem ba f RII? 
am | sae dr= 5 | gigi dr ane R;Rir’ dr. 
0 0 0 


A calculation of the integral, with the functions 
Ry =2(mayre”"”, Ri = (1/'V2)(ma)3(1 — 3marye2"" 
(see QM, §36) and the energy difference 
w = ei — ef = 3ma*(1 — 1/2’) = gma’, 
gives I = 2°7a7/9m. Hence the transition probability is (in ordinary units) 


5 532 3 2 ii 
w= Pre =a =56X10°sec |. 


The corresponding lifetime of the 2s: state is very long, and in practice it is much more likely that 
de-excitation will occur by the simultaneous emission of two photons; see the next-to-last footnote to 
§59. 


§ 53. Radiation from diatomic molecules: electronic spectra 


The specific features of molecular spectra are mainly due to the partition of the 
energy of the molecule into electronic, vibrational and rotational parts, each of the 
latter two being small compared with the previous one. The level structure of 
diatomic molecules has been described in detail in QM, Chapter XI. Here we shall 
consider the resulting pattern of the spectrum and the calculation of line strengths.t 

Let us take first the general case, in which the electronic state of the molecule 
(and therefore also, in general, the vibrational and rotational states) changes in the 
transition. The frequencies of such transitions lie in the visible and ultra-violet 
regions of the spectrum. They are spoken of collectively as the electronic spectrum 
of the molecule. We shall always be considering electric dipole transitions; those of 
other types are of little importance in molecular spectroscopy. 

As with dipole transitions in any system, the following selection rule applies to 
the total angular momentum J of the molecule: 


IJ’-J|<1<J 4". (53.1) 


In the present case, the strict selection rule regarding the parity of the system 
corresponds to a selection rule regarding the sign of the level. (In the customary 
terminology of molecular spectroscopy, states having wave functions which do or 
do not change sign on inversion, i.e. when the coordinates of the electrons and the 
nuclei change sign, are called negative and positive states respectively.) Thus we 
have the rigorous rule 


+3-, —->+. (53.2) 


+ The discussion below is based on QM, §8§78 and 82-88. For brevity, we shall not make constant 
reference to those sections. 
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If the molecule consists of identical atoms (with nuclei of the same isotope), the 
levels can be classified with respect to interchange of the coordinates of the nuclei: 
symmetric (s) levels, with wave functions which do not change sign under this 
transformation, and antisymmetric (a) levels, with functions which do change sign. 
Since the electron dipole moment operator is unaffected by this transformation, its 
matrix elements are non-zero only for transitions without change of this sym- 
metry:t 


s>S, a> a. (53.3) 


This rule is not absolutely rigorous, however, since the existence of a given 
symmetry property of a level depends on a certain definite value of the total spin I 
of the nuclei in the molecule. Owing to the extreme weakness of the interaction 
between the nuclear spins and the electrons, the spin I is very nearly conserved, 
but not exactly. When this interaction is taken into account, I does not have a 
definite value, the symmetry property (s or a) is not conserved, and the selection 
rule (53.3) no longer applies. 

The electron terms of a molecule consisting of identical atoms are also described 
by their parity (g or uw), 1.e. the behaviour of the wave functions when the electron 
coordinates (measured from the centre of the molecule) change sign while the 
coordinates of the nuclei remain unchanged. This property is closely related to the 
nuclear symmetry and the sign of the rotational levels belonging to this term. The 
levels which belong to an even (g) electron term can be s+ or a-—, and those 
belonging to an odd (u) term can be s — or a +. The rules (53.2) and (53.3) therefore 
give the further rule 


g>u, ug. (53.4) 


The rule (53.4) remains approximately valid for molecules consisting of different 
isotopes of the same element. Since the nuclear charges are equal, we can consider 
the electron term with fixed nuclei, and thus have a system of electrons in an 
electric field which possesses a centre of symmetry at the midpoint of the line 
joining the nuclei. The symmetry of the electron wave function with respect to 
inversion in this point determines the parity of the term, and since the electric 
dipole moment vector changes sign under this transformation, we arrive at (53.4). 
The rule as derived in this way is only approximate, because the nuclei have been 
regarded as fixed, and it therefore ceases to be valid when the interaction between 
the electron state and the rotation of the molecule is taken into account. 

Further selection rules depend on specific assumptions concerning the relative 
magnitude of the different interactions in the molecule (i.e. the type of coupling), and 
therefore can only be approximate. 

The majority of the electron terms in diatomic molecules belong to coupling 
type a or b. Both these have the property that the coupling of the orbital angular 
momentum with the axis (the electric interaction between the two atoms in the 
molecule) is large compared with all other interactions. The quantum numbers A 


+ This rule is clearly valid for transitions of any multipole order. 
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and S therefore exist, these being respectively the component of the orbital angular 
momentum of the electrons along the axis of the molecule and the total spin of the 
electrons. The operator of an orbital quantity, the electron orbital angular momen- 
tum, commutes with the spin operator, so that 


S’'-S=0 (cases a and b). (53.5) 
The change in A must satisfy the selection rule 
A'—-A=0,+1 (cases a and b), (53.6) 
and for transitions between states with A = 0 (& terms) there is a further rule 
~t>d*, Y—>X (cases a and b). (53.7) 


(The states %* and > differ as regards behaviour under reflection in a plane 
through the axis of the molecule.) The rules (53.6), (53.7) are obtained by consider- 
ing the molecule in a system of coordinates fixed to the nuclei (see QM, §87); the 
rule (53.6) is analogous to the selection rule for the magnetic quantum number in 
atoms. 

The coupling types a and b differ as regards the relation between the spin—axis 
interaction energy and the rotation energy (the intervals between rotational levels). 
In case a the former is greater, in case b it is much smaller. We shall now examine 
these cases separately. 

Case a. Here the quantum number > exists, which is the component of the total 
spin along the axis of the molecule (and therefore so does the number 0 = 3+ A, 
the component of the total angular momentum). If both the initial state and the final 
state belong to case a, then we have the rule 


>'-X=0 (case a), (53.8) 


which follows from the fact, already mentioned, that the dipole moment commutes 
with the spin. From (53.6) and (53.8) it follows thatT 


Q'-— Q = 0, +1. (53.9) 


If Q=Q'=0, then in addition to the general rule (53.1) the transitions with J' = 
are forbidden:+ 


J’-J =+1 when QX=)'=0 (case a). (53.10) 


+ This rule remains valid in case c also (where the coupling of the orbital angular momentum with 
the axis is small compared with the spin-orbit coupling) although the numbers A and > do not separately 
exist. 

+ This rule is analogous to the prohibition of atomic transitions with J = J' when M = M'= 0 (see 
851), but that rule was of possible interest only in the presence of an external field. Here the rule follows 


immediately from formula (53.12) below; the 3j-symbol @ 0 a is zero if J'=J, since the sum 
J'+J+1 is odd. 
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Let us consider transitions between any two specified vibrational levels belong- 
ing to two different electron terms (of type a). When the fine structure of the 
electron term is taken into account, each of these levels splits into several 
components, the number of which, 2S + 1, must be the same for both, according to 
the rule (53.5). According to the rule (53.8), each component of one level combines 
with only one component of the other level, having the same value of &. 

Let us next take a pair of levels with the same %; the values of 0 and ’ can 
differ (like A and A’) by 0 or +1. When rotation is taken into account, each level 
splits into a series of levels with different values of J and J’ in the ranges J = |Q|, 
J'2=|Q'|. The dependence of the transition probabilities on these numbers can be 
derived in a general form (H. Honl and F. London, 1925). 

The matrix element of the transition nAQJM, > n'A'QO'J'M 5 (where n denotes the 
characteristics of the electron term other than 2 and A) ts 


Kn'A'O/S'M4jdq|n AQJM;)| = 


_ re eee ee a ae ee 
=VIQT+DQT+D Sg or alu qm, )KnA'ldelnay, — (53.1) 


where d, and d, are respectively the spherical components of the dipole moment 
vector in the fixed coordinate system xyz and in the “moving” system én¢ with the 
C-axis along the axis of the molecule. This formula is derived by means of QM, 
(110.6). The matrix elements (n’A’|d,|n A) are independent of the rotational quan- 
tum numbers J, J', and depend only on the characteristics of the electron terms 
(and in this case are also independentt of the number %; the numbers (’ = A’+ > 
and 2 = A+% are therefore omitted in the notation for the matrix element. 

The probability of the transition nAQJ —n'A'Q'J' is proportional to the square 
of the matrix element (53.11) after summation over Mj. Using the formula QM 
(106.12): 


a # ! At 
M} — M3 q M; 2J+V 


we obtain 


Je I J 


w(MADI >n'N'0T)=2S'+D( “og ag 9 


2 
) B(n’,n: A’, A), (53.12) 


where the coefficients B are independent of J and J' (we are, of course, neglecting 
the relatively very small difference in the frequencies of transitions with different J 
and J').£ 


+ This can be shown in the same way as was done for the scalar f in QM, beginning of §29. In the 
present case the operator of the vector quantity d commutes with that of the vector S, which is 
conserved (in the zero-order approximation), and = is the component of S along the ¢-axis in the rotating 
coordinate system in which the condition of commutability of d and S has to be considered. 

+ Each of the rotational levels J considered splits into two when A-doubling is taken into account; 
one of the two is positive and the other negative. Thus, instead of one transition J >J', we have, using 
the selection rule (53.2), two transitions: from the positive and negative components of the level J to the 
negative and positive components, respectively, of the level J'. The probabilities of these transitions are 
equal. 
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If we sum (53.12) with respect to J’, then (because of the orthogonality of the 
3j-symbols, QM (106.13)) the result is simply B(n’,n; A’, A). Thus the total 
probability of transitions from a rotational level J of the state © to all levels J’ of 
the state ()' is independent of J. 

Case b. Here the quantum number K exists, which is the angular momentum of 
the molecule without regard to its spin, as well as the total angular momentum J. 
The selection rules for K are the same as the general selection rules for any orbital 
vector quantity (such as the electric dipole moment): 


|K'-K|<1<K+K' (case b), (53.13) 


together with the prohibition of a transition with K = K’' when A= A'=0 (cor- 
responding to (53.10)): 


K'—-K=+1 when A=A’'=0. (53.14) 


Let us consider transitions between rotational components of specified vibra- 
tional levels of two electron states belonging to type b. The probabilities of such 
transitions are given by the same formula (53.12), with K and A instead of J and . 
When the fine structure is taken into account (for S40), each rotational level K 
splits into 2S+1 components with J =|K—S|,...,K +S, and so a multiplet 
appears in place of the single line J > J’. Since in this case we have addition of the 
angular momenta K and S, which are free (i.e. not coupled to the axis of the 
molecule), the formulae for the relative transition probabilities for the various lines 
in the multiplet are the same as the corresponding formulae (49.15) for the fine 
structure components of atomic spectra, where the corresponding angular momenta 
(in the case of LS coupling) are L and S. 

Thus we have examined the selection rules governing the possible spectral lines 
in all the fundamental cases that can occur in diatomic molecules. 

The group of lines arising from transitions between rotational components of 
two given electronic-vibrational levels forms what is called in spectroscopy a band; 
the lines in a band are very close together, because the rotational intervals are 
small. The frequencies of these lines are given by the differences 


hw); = constant + BJ(J +1)— B'J'(J' + 1), (53.15) 


where B and B’ are the rotational constants in the two electronic states; in order to 
avoid unnecessary complications, the electron terms are assumed to be singlets. For 
J'=J,J +1, formula (53.15) is represented graphically (Fig. 2) by three parabolic 
branches, whose points for integral J give the values of the frequencies. (The 
arrangement of the branches in Fig. 2 corresponds to the case B’ < B. If B’ > B, their 
open ends are towards small values of w, and the branch with J'=J—1 is the 
highest.)t The existence of a branch which passes through an apex is seen from the 
diagram to cause the lines to become increasingly dense towards a certain limiting 
position (the head of the band). 


+ The series of lines corresponding to transitions with J'=J+1,J,J—1 are called the P, Q and R 
branches respectively. 
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In connection with line strengths, mention should be made of the curious effect of 
alternating intensities in certain bands of the electronic spectra of molecules 
consisting of atoms of the same isotope (W. Heisenberg and F. Hund, 1927). The 
symmetry conditions pertaining to the nuclear spins have the result that, in the 
electron > terms, the rotational components with even and odd K have opposite 
symmetry with respect to the nuclei, and therefore different nuclear statistical weights 
g, and g, (see QM, $86). According to the rule (53.14), only J’= J +1 is allowed in 
transitions between the two different } terms, and according to the rule (53.4), one of 
the & terms must be even and the other odd. The result is that, for a given value of 
J'—J, transitions with successive values of J take place alternately between pairs of 
symmetric levels and pairs of antisymmetric levels, as shown in Fig. 3 for the example 
of the states &; and %7,. The observed line strength is proportional to the number of 
molecules in the initial state concerned, and therefore to its statistical weight. Thus the 
intensities of successive lines (J = 0, 1, 2,...) willbe alternately greater and less, being 
alternately proportional to g, and g, (this behaviour being superimposed on the 
monotonic variation given by formula (53.12)).T 

There are no exact selection rules concerning the change in the vibrational 
quantum number in transitions between two different electron terms. There is, 
however, a rule (Franck and Condon’s principle) whereby the most probable 
change in the vibrational state may be predicted. It is based on the fact that the 


+ Here we assume that all the states having different values of the total nuclear spin are uniformly 
occupied. 
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U(r) 


U'(r) E 


Fic. 4. 


motion of the nuclei is quasi-classical, because of their large mass (cf. the discussion 
of pre-dissociation in QM, §90).t 

In the integral which determines the matrix element of the transition between 
vibrational states E and E’ of electron terms U(r) and U(r), the most important 
range is the neighbourhood of the point r = ro where 


U(n)- U'n) =E-E', (53.16) 


i.e. the momenta of the relative motion of the nuclei in the two states are the same, 
p =p’. For a given value of E, the transition probability as a function of the final 
energy E’ increases as each of the differences E— U and E’— U' decreases, and is 
a maximum when 


E— U(n) = E'— U(r) = 9, (53.17) 


i.e. when the “transition point” rp (the root of equation (53.16)) coincides with the 
classical turning point of the nuclei. (Fig. 4 illustrates graphically this relationship 
between E and the most probable E’.) This can be intuitively expressed by saying 
that the transition is most probable near the turning point of the nuclei, where they 
spend a relatively large amount of time. 


§ 54. Radiation from diatomic molecules: vibrational and 
rotational spectra 


The selection rules and formulae for transition probabilities given in §53 remain 
valid for transitions in which the electronic state of the molecule is unchanged,+ 
Here we shall discuss only some particular features of these transitions. 

First of all, the selection rule (53.4) prohibits all (dipole) transitions without 
change of electronic state in molecules consisting of like atoms, since in such a 
transition the parity of the electron term would remain unaltered. It follows from 
the discussion in §53 that such transitions can be allowed only when the interaction 
between the nuclear spins and the electrons is considered or, for molecules of 


+ Strictly speaking, it is also necessary that the vibrational quantum number should be sufficiently 
large. 

+ Transitions in which the vibrational (and therefore the rotational) state changes form what is called 
the vibrational spectrum of the molecule; this lies in the near infra-red (wavelengths <20 ym). 
Transitions in which only the rotational state changes form the rotational spectrum in the far infra-red 
(wavelengths >20 pm). 
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different isotopes of the same element, because of the effect of the rotation on the 
electronic state. 

The calculation of the dipole moment matrix elements is reduced (by the 
formulae in QM, 887) to a calculation in a coordinate system rotating with the 
molecule. The wave function of the molecule in these coordinates is the product of 
the wave function of the electrons for a given distance r between the nuclei and 
the wave function of the vibrational motion of the nuclei in the effective field U(r) 
of the electrons and the nuclei. When the influence of the motion of the nuclei on 
the electronic state is entirely neglected, the initial and final electron wave func- 
tions for the transitions in question are the same. The integration over the electron 
coordinates therefore gives, in the matrix element, simply the mean dipole moment 
d of the molecule (which is obviously along its axis) as a function of the distance r. 
Owing to the smallness of the vibrations, the function d(r) can be expanded in 
powers of the vibrational coordinate q=r-—r. For transitions which involve a 
change in the vibrational state, the zero-order term in the expansion does not occur 
in the matrix element, because the wave functions are orthogonal for vibrational 
motion in the same field U(q), and this leaves the term which tis proportional to q. 
If the vibrations are regarded as harmonic, it follows from the known properties of 
the linear oscillator (QM, §23) that the matrix elements are zero except for 
transitions between adjacent vibrational states; thus, for the vibrational quantum 
number v we have the selection rule 


v'—-v=41. (54.1) 


This rule is not valid, however, when the vibrations are not harmonic and the 

subsequent terms in the expansion of the function d(q) are taken into account. 
For a purely rotational transition (with no change in the vibrational state also), 

the matrix element of the dipole moment component along the moving C-axis can 


simply be equated to the mean dipole moment of the molecule, d = d(0).+ The 
probability of the transition J —> J —1 is then 


4w? a P- OY 
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(54.2) 


and from this formula we can calculate not only the relative probabilities (as in 
(53.12)) but also the absolute probabilities. Formula (54.2) is for case a; in case b, J 
and () are to be replaced by K and A. 

The frequencies of the purely rotational transitions are given by the differences 
of the rotational energies BJ(J + 1): 


hwy, 3-1 a 2BJ. (54.3) 


Successive lines are at equal distances 2B. 


+ It is obvious from symmetry that d = 0 in a molecule consisting of like atoms. 
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§55. Radiation from nuclei 


For y radiation from nuclei it is usually true that the dimensions of the system 
(the radius R of the nucleus) are small compared with the wavelength of the 
photon. But the intervals between the nuclear levels, and therefore the energy of 
the y quantum, are generally small compared with the energy per nucleon in the 
nucleus. The quantity R/A is thus not directly related to the velocity v/c of the 
nucleons in the nucleus, and is in general considerably less than v/c. Accordingly, 
the probability of MI radiation is usually greater than that of E, |+ 1 radiation (cf. 
the beginning of §50). 

The general selection rules for the total angular momentum (the “spin’’) of the 
nucleus and for the parity are the same as for radiation from any system. The 
characteristic feature of nuclear radiation is that transitions of high multipole order 
commonly occur. Unlike atoms, whose radiation is usually of the electric dipole 
type, nuclei undergo such transitions comparatively rarely at low energies, on 
account of the selection rules. 

If a radiative transition of a nucleus can be regarded as a single-particle 
transition (a change of state of one nucleon while the state of the rest of the 
nucleus remains unchanged), then there are additional selection rules regarding the 
angular momentum of that nucleon, but in practice such “‘single-particle”’ selection 
rules are found to be only approximately obeyed. 

The selection rules for the isotopic spin are peculiar to nuclei. The component 
T3 of the isotopic spin is determined by the atomic weight and atomic number of 
the nucleus: 


T3=(Z -N)=Z—3A. 


When the value of T3 is specified, the absolute magnitude of the isotopic spin can 
take any value T =|T;|. The selection rule for the number T in radiative transitions 
arises because the electric and magnetic moment operators of the nucleus, ex- 
pressed in terms of the isotopic spin operators of the nucleons, are the sums of a scalar 
and the x;-component of a vector in isotopic space; see QM, §116. Their matrix 
elements are therefore zero unless 


T’— T =0, +1. (55.1) 


This rule itself, however, imposes no special restrictions on transitions in light nuclei 
(the only ones for which the isotopic spin can be said with reasonable accuracy to be 
conserved); the low levels of these nuclei in fact include none with T > 1. 

For El transitions, however, there is a further rule which arises because there 
is no isotopic-scalar part for the electric dipole moment, and the operator of this 
moment is simply the x3-component of the isotopic vector (see QM, 8116). Hence, 
if T; = 0, transitions with AT = 0 are also forbidden, and so, in nuclei having equal 
numbers of neutrons and protons (N = Z, A=2Z), E1 transitions are possible only 
if 


T'-T=+1 (T;=0). (55.2) 


QE4-0 
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The accuracy with which this rule is obeyed depends, of course, on the exactness 
of conservation of the isotopic spin of the nucleus. 

The probability of E1 transitions in the nucleus is influenced also by the recoil 
of the rest of the nucleus when a particular nucleon moves. The result is that the 
protons contribute to the dipole moment with an effective charge e(1 — Z/A) instead 
of e, and the neutrons with a charge —eZ/A instead of zero (see QM, 8118). The 
decreased effective charge of the proton causes some reduction in the probability 
of E1 transitions. 

The energy levels of non-spherical nuclei have a rotational structure, and 
therefore such nuclei show a characteristic rotational structure of the y-ray 
spectrum. 

The symmetry of the field in which the nucleons move in a “fixed” non- 
spherical (axial) nucleus is the same as the symmetry of the field in which electrons 
move in a “‘fixed’’ diatomic molecule consisting of like atoms (the point group C..;). 
The symmetry properties of the levels of a non-spherical nucleus (and hence the 
selection rules for the matrix elements) are therefore analogous to those of the 
diatomic molecule (see QM, §119). In particular, electric dipole transitions within a 
single rotation band (i.e. without change in the internal state of the nucleus) are 
forbidden, as in a diatomic molecule of like atoms; cf. §54. Such transitions 
therefore occur as E2 or M1 transitions. In the former the total angular momentum 
J of the nucleus can change by 2 or 1, in the latter by 1. 

According to (46.9), the probability of a quadrupole transition, summed over 
values of the component M' of the total angular momentum of the nucleus in the 
final state, is 


5 
@ / / e 
Wr? > he » KJ OM Q?-m{|JOM)), 


where J is the total angular momentum of the nucleus, 1. its component along the 
axis of the nucleus, and m = M—M_’. By means of QM, (110.8) this sum can be 
expressed in terms of the squares of given quantities, the diagonal (with respect to 
the internal state of the nucleus) quadrupole transition moments Q., defined 
relative to coordinates nf moving with the nucleus. Here A = (.—’, so that in the 
case considered (Q’ = Q) only the component Qy) appears. The quantity 


eQo =e | pi (20° — ci n’) dé dyn dg = — 2e(Qo)i 


is called simply the quadrupole moment of the nucleus. Hence 


a AS (55.3) 


5 
we(QJ > 2’) = aos QAI + D( "og = ae 


Explicitly, we have 


7 7 w°? YJ? - 0?) 

wet > 9, I~) = a4 03 86 Gps es + Dy 
9) = 2 9? LEONG — 1-0" 

wetOS > 0.5 ~ 2) = FoR es 20 FHS — Near + by 


§56 The Photoelectric Effect: Non-relativistic Case 207 


The following remark is necessary concerning these formulae. They include 
matrix elements calculated with wave functions of the form 


Wrom = constant X ypD§h(n), 


where xo is the wave function of the internal state of the nucleus. These functions 
correspond to values of the ¢-component of the angular momentum which are 
definite in both magnitude and sign. In nuclei, however, states have only a definite 
parity and a definite magnitude of the angular-momentum component (usually 
taken as ()). Hence, if 1 0, the initial and final wave functions would have to be 
taken as combinations of the form 


os (Wrom + Wyo, m), 
os (byom + Wy, -o, uw). 


The products of the first terms and of the second terms will give the same value as 
above for the quadrupole moment matrix elements, but the cross-products will lead 
to non-vanishing integrals if 20 <2.t Hence formula (55.3) is not strictly valid if 
Q = or 1. In these cases the transition probability contains an additional term 
which cannot be expressed in terms of the mean value of the quadrupole moment.+ 

In a similar manner to the derivation of (55.3), we obtain for the M1 transition 
probability 


bw? J-1 1 7° 
waiOT +0, 3-1) =F" WAT -H("g ; a) 
Aw? , J7-O 


~ 3H” JQI +1)’ 


where pw is the magnetic moment of the nucleus. This formula is not valid if 0 = 3. 


§56. The photoelectric effect: non-relativistic case 


In §49-S2 we have discussed radiative transitions (with emission or absorption 
of a photon) between atomic levels of the discrete spectrum. The photoelectric 
effect differs from such a photon absorption process only in that the final state 
belongs to the continuous spectrum. 

The cross-section for the photoelectric effect can be calculated in an exact 


+ For the matrix elements of 2'-pole moments, the integrands will include products of the form 
DO%-DY, Dah. 


The angle integral will not be zero if q’ = — 20, and the range of values of q' is only from —! to +1; thus 
we must have 220 <I. 

+ This term in fact gives a significant correction only in the case 1. = >, when the coupling between 
the rotation and the internal state of the nucleus is especially large (see QM, $119). 
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analytical form for the hydrogen atom and for a hydrogen-like ion (with atomic 
number Z < 137). 

In the initial state, the electron is at a discrete level ¢; =—I (where I is the 
ionization potential of the atom) and the photon has a definite momentum k. In the 
final state, the electron has momentum p (and energy se; =e). Since p takes a 
continuous series of values, cross-section for the photoelectric effect is 


do = 2a|V,?5(-I + w — €) d’p|(2zy (56.1) 


(cf. (44.3)); the wave function of the final state of the electron is normalized to 
‘fone particle per unit volume’’. The wave function of the photon is, as before, 
normalized in the same way; in order to obtain the cross-section do, the probability 
dw then has to be divided by the photon flux density (which is c/V=c when 
V = 1), but when relativistic units are used this does not affect the form of (56.1). 

As in (45.2), we choose the three-dimensionally transverse gauge for the photon. 
Then 


a 1 
Vii =—eA- yi = eV (47) V (20) Mii 
where 
My = i w'*(a-e) e* Tw d’x, (56.2) 


with » = yy and w' = uw; the initial and final wave functions of the electron. Putting 
in (56.1) d*p +p’ d|p| do = e|p| de do and integrating to remove the delta function of 
e, we can write this formula as 


do = e? fel IMji[° do. (56.3) 


The calculations will be given in two cases, which differ as regards the magnitude 
of the photon energy: » >I and w<m. Since I ~ me*Z’<m, these two ranges 
partly overlap (when I <w <m), and so an examination of the two cases gives an 
essentially complete description of the photoelectric effect. 

We shall take first the case 


w <m. (56.4) 


The electron velocity is then small in both the initial and the final state, and the 
problem is therefore entirely non-relativistic as regards the electron. Accordingly, 
we replace a in (56.2) by the non-relativistic velocity operator ¥ = —iV/m (cf. $45). 
We can also use the dipole approximation, putting e“"~1, i.e. neglecting the 


§56 The Photoelectric Effect: Non-relativistic Case 209 
momentum of the photon in comparison with that of the electron. Then 
do = o2 mp! le : vail” do, 
27H 
(56.5) 
Vii = -+/ w'*Vib - dx. 


We shall consider the photoelectric effect from the ground level of the hydrogen 
atom (or of a hydrogen-like ion). Then 


=e (Ze*m ee —Ze2mr, 
— a e ; 


v (56.6) 


in ordinary units, me? becomes 1/a), where a) = h?/me’ is the Bohr radius. 

The wave function w’ must be taken such that its asymptotic form comprises a 
plane wave (e'?’") together with an ingoing spherical wave; cf. QM, §136, where 
this function was denoted by w<”. On account of the selection rule for |, a transition 
from an s state can only be to a p state (in the dipole case). Thus, in the expansiont 


yO > S i214 1) eR, (r)P(n- ny), (56.7) 
[=0 


where n=p/p, n,=r/r, it is sufficient to retain the term with |=1. Omitting 
unimportant phase factors, we therefore have 


y= os (n+ n,)R,1(1). (56.8) 


With the functions w and wy’ given by (56.6) and (56.8), we have 


2 5/2 
ee vy = oe | | (n> n,)(n; - e) eg ZrrR (r) do,:r’ dr 
2 \5/2 ; 
= Ween mn) (n- e) | re" Ro). 
0 


According to QM (36.18), (36.24), the radial function is (in the units employed here) 


R _ V(87r)Ze’m J 1+? 
a 3 v(l1—e*™”) 


pre ”"F(2 + iv, 4, 2ipr), 
with 
y=Ze*m[p (= Ze’/hv). (56.9) 


+ In the rest of this section, p denotes |p|. 
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The integral can be calculated by means of the formula 
| ez F(a, y, kz) dz =T(y)A* (A —k) 
0 

cf. QM, (f.3). Noticing also that 


°\ iv 
( ae | = e 2v cot! v 
v-l ; 


2° avn : e) eo cot7} p 


we obtain 


The energy of ionization from the ground level of the hydrogen atom (or a 
hydrogen-like ion) is I = Z’e*m/2. Hence 


Pe 


p p° ; 
w= 7 tl=, 7 (tv). (56.10) 


Using this relation, we obtain as the final expression for the cross-section for the 
photoelectric effect with emission of an electron into the solid-angle element do 


do =2’ ial he ® -ey’ d 56.11 
Oo — TAA hw =~ (n e) 0, ( . ) 


where a = h’/mZe’ = ao/Z (ordinary units are used here and below). The angular 
distribution of the emitted electrons is governed by the factor (n-e)’. This has 
maxima in the directions parallel to the direction of polarization of the incident 
photons, and is zero in directions perpendicular to e, including the direction of 
incidence. For unpolarized photons, formula (56.11) must be averaged over the 
directions of e, which is equivalent to substituting 


(n - e)’ > (My X ny’ 
with no = k/k; see (45.4b). 


Integration of formula (56.11) over all angles gives the total cross-section for the 
photoelectric effect: 


V9 a? I 4 e fvcot hy 
C= 3 aa’ Go) T=eu™ (56.12) 


(M. Stobbe, 1930). 
The limiting value of 0 as hw >I (i.e. as vy >&) is 
2 ar, Dr’ aad 


et C4 = gr = 0.23.a0/Z’, (56.13) 


Co 
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where e in the denominator is the base of natural logarithms. The cross-section for 
the photoelectric effect tends to a constant limit near the threshold, as it must for a 
reaction forming charged particles (see QM, §147). 

The case in which hw>TI (still with fa <mc’) corresponds to the Born 
approximation (v = Ze*/hv <1). Formula (56.12) becomes 


8 7/2 
ee a aaiz'(<*) (56.14) 


where I) = e*m/2h’ is the ionization energy of the hydrogen atom. 

The process inverse to the photoelectric effect is the radiative recombination of 
an electron with an ion at rest. The cross-section o,.. for this process can be found 
from that for the photoelectric effect (o>,) by means of the principle of detailed 
balancing (QM, $144). This principle states that the cross-sections for the processes 
i—f and f >i (with two particles in each of the states i and f) are related by 


2 = 2 
QiP | Oi--5 = QfP FO Fi, 


where pj, py are the momenta of the relative motion of the particles, and g;, 9, the 
spin statistical weights of the states i and f. Since g =2 for the photon (which has 
two possible directions of polarization), we find for the hydrogen atom ground state 


Orec ~ Oph * 2k’/p’, (56. 15) 


where p= mv is the momentum of the incident electron and k that of the emitted 
photon. 


PROBLEMS 


PROBLEM 1. Derive formula (56.14) by direct use of the Born approximation in the non-relativistic 
case. 

SOLUTION. In the Born approximation, w’ in (56.5) is simply the plane wave p’=e'?'", and w& is 
again the function (56.6). Then 


I pi 
Vii =w=— [ wey d°x 


p (Ze’my”? —Ze2mr 
= ai. = (e )p- 


The Fourier component is given by (57.6b), and so 
vp = 8V ap ?m??(Ze*)"n. 
Substitution in (56.5) and integration over do leads to (56.14) (here, with sufficient accuracy, p’?/2m ~ w). 


PROBLEM 2. Determine the total cross-section for radiative recombination of a fast but non- 
relativistic electron (I <mv’ <mc’) with a nucleus having charge Z < 137. 


SOLUTION. The cross-section for capture to the K shell (principal quantum number n = 1) is obtained 
by substituting (56.14) in (56.15): 


7 $/2 
feos wu 5). 322 Io 
3 E 
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where « =2mv~ is the energy of the incident electron, and hw ~ «. Among the other states of the 
resulting atom, only s states are important: in the calculation of the matrix element in the Born 
approximation, the important values are those of the wave function of the bound state when r is small 
(as will be seen in §57), and when | >0 these values are small compared with those for | =0. It is 
sufficient to take the first two terms in the expansion of & in powers of r. For states with | = 0 and any n, 
these terms are 


1 r 
= i 
v V (aa 5 \ a 
i.e. they contain n only as a common factor n°”; this expression is obtained by expansion of QM, 
(36.13). The total recombination cross-section is therefore 


The value of the zeta function is (3) = 1.202. 


§57. The photoelectric effect: relativistic case 


Let us now consider the case 


wo >I. (57.1) 


Here we have also ¢€=w-—I®*>I, and the influence of the Coulomb field of 
the nucleus on the wave function wW’ of the emitted electron can be taken into 
account by means of perturbation theory. We shall write 


a V 3 (u’ e+ fy), On2) 


The electron may be relativistic, and therefore the unperturbed function in (57.2) is 
written as a relativistic plane wave (23.1). 

Although the electron is non-relativistic in the initial state, its wave function w 
must nevertheless, for reasons to be explained below, include the relativistic 
correction (~ Ze’). This function is (cf. §39, Problem) 


ye ae u 
wy ~— (1 Im yoy v)| V(2m) Wron-rs (57.3) 


where Wron+ 1S the non-relativistic bound-state function (56.6), and u is the bispinor 
amplitude of the electron at rest, normalized by the usual condition tu = 2m. 
We substitute the functions (57.2), (57.3) in the matrix element (56.2):7 


Mj = mo | {2'y -e) (1 = st yy: 1) mone em 
+ p(y +e) e* wthone} d?x. (57.4) 


+ The function (57.3) has been derived for distances r~ 1/mZe’, at which the relative order of 
magnitude of the correction term is Ze’. But for the ground state (and for all s states) formula (57.3) is 
valid for any r, since the derivative of the purely exponential function (56.6), and therefore the 
correction term in (57.3), are always proportional to Ze’. This enables us to use formula (57.3) in the 
present problem, where (as we shall see below) it is the small values of r that are important. 
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In order to derive the first term of the expansion of this quantity in powers of Ze’, 
we can replace Wnon-, in the second term in the braces by the constant (Ze’m)??/V7 
simply. The first term would vanish if treated in this way when p—k #0, and it is 
for this reason that the first relativistic correction, proportional to Ze’, has to be 
included in w#. When v ~ |, this correction gives a contribution to the cross-section 
that is of the same order as the contribution of the next term in the expansion of 
Wnonr IN powers of Ze’. 

In the’ first term in (57.4) we integrate by parts, transferring the action of the 
operator V from Wpon to the exponential factor. The result is 


2 3/2 = 
My = ee faye) Lt 5 vy (p— Wd [ule pa + FAY eu} 


(57.5) 


where the vector suffix denotes the spatial Fourier component. As far as the Ze’ 
term we havet 


zemr  _ 8aZe’m 
(e Joa = (p = k)' , (57.6) 


To calculate the Fourier components w{, we write down the equation satisfied 
by the function pp: 


(y°e tiy:V—m)p = e(y"A,)u'e?’* 


Ze’ — 
= es you' eiP a 


obtained by substituting (57.2) in (32.1). Applying the operator y°e +iy-V+m to 
both sides, we find 


(At pp = — Ze(y°e +iy -Vtm)(y°u’) , iPr, 


Multiplying this equation by e "' and integrating with respect to d°x, with the 


+ Taking the Fourier component of each side of the equation 


—aAr 
(A — 07) — = —4n8(r), 


we obtain 
eS Acar 
= 57. 
(CF), -aer (57-68 
and differentiation with respect to A gives 
(0 )q= pt, (57.6b) 
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usual integration by parts in the terms containing A and V, gives 


(p? = KY? = — ZeXye—y- + m\(y'w')(Z) 


= — Ze*(2ey’—y- (k— pu _ p)” 


In the last line we have used the fact that the amplitude wu’ satisfies the equation 
(ey’—p-y—m)u'=0, or (ey°’+p-y—m)y°u' =0. 


Hence 


2ey+y-(k—p) 
=). Deno x 2-,4EY TY" p 
Wri = Wk = 47Zeil ee aaa »)(k py? : (57.7) 


Substituting (57.6) and (57.7) in the matrix element (57.5), we can write it as 


4 1/2 Ze 5/2 2 
ea ae eee 


where 


A=aly:e)t+(ye)y'(y°b)+(y:oy"(y -e), 


Pie comedies, gece a. 
(k—p) mk’—p” 2m(k— p)” 2m(k’ — p’) 


The cross-section 1S 


~Se@e me (i'Au)(iAu') do, 


w(k — 


where A = y°A*y°; see §65. This expression has to be summed over final directions 
and averaged over initial directions of the electron spin, using the rules given in $65 
below and the polarization density matrices of the initial and final states: 


p=2m(y'+1), p’=2y°e—y- ptm); 
in the initial state, p= 0 and ¢ = m. The resulting expression is 


2 5 
dg = 16e-(Ze"my Dl 5 (Ap) do. 


mw(k — p)* 


The calculation of the trace by means of the formulae (22.22) is purely algebraic, 
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and the result is 


tr(p'ApA) = - oe [ap—(b-—c)e+m)/+4m(b- e)[(e + m\(c-e)+ a(p:e)). 


the vector e is assumed real, i.e. the photon is assumed to be linearly polarized. 

The formula for the photoelectric effect cross-section will be put in its final 
form by using the polar angle @ and the azimuth ¢ of the direction of p when the 
direction of k is the z-axis and the plane of k and e is the xz plane (so that 
p:e=|p|cos ¢ sin @). When w >I, the conservation of energy may be written in 
the form ¢ — m = @ instead of e-m =w—I. We then easily see that 


k’—-p’=-2m(e-—m), (k—p)’=2e(e—m)(1—v cos 8), 
where v = p/e is the velocity of the photoelectron. A simple calculation gives finally 


vi(1—v’) sin’ 0 


_ ZS . 44,2 
be Sele Locos ey 


x {Boo COs )+|2- Da VO1~ PKG = 2-€08 0) cos? 6 | do 


(57.8) 
where r, = e7/m. 
In the ultra-relativistic case (« > m), the photoelectric effect cross-section has a 


sharp peak at small angles 9~V(1—v’), ie. the electrons are emitted pre- 
dominantly in the direction of incidence of the photon. Near the maximum 


1—vcos 6 ~3[(1— v’)+ 67], 
and the leading terms in (57.8) give 


ee (1- v 23/2 93 


da ~4Z>a i (lv? +0): dé dd. (57.9) 


The integration of (57.8) over angles is elementary but lengthy, and leads to the 
following expression for the total cross-section (F. Sauter, 1931): 


eo v- {3 v=?) ( _ 1 ao elt 
7 ema ye 3) ytd VV) ey Voy 
(57.10) 
where the “Lorentz factor” y is used for brevity: 
_ | _é&_mto 
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In the ultra-relativistic case, this formula reduces to the simple expression 
o =20Za'r2|y; (57.12) 


in the case I <wm <™m, the limit of small y — 1 in (57.10) yields the already known 
result (56.14). 


§58. Photodisintegration of the deuteron 


A distinctive property of the deuteron is that its binding energy is small in 
comparison with the depth of the potential well. This enables reactions involving 
the deuteron to be described without a detailed knowledge of the behaviour of 
nuclear forces, using only the binding energy (see QM, $133). Here it is assumed 
that the wavelengths of the colliding particles are large compared with the range a 
of the action of nuclear forces. 

This applies to the disintegration of the deuteron by y quanta having ka <1. It 
will also be assumed that pa <1, where p is the momentum of relative motion of 
the neutron and proton released; this is a stronger condition than ka <1.T 

We start from the non-relativistic formula (56.5) for the photoelectric effect 
cross-section, integrating over all directions: 


p M4n 
PM4t WY), 


TT@ 


where p is the momentum of relative motion of the proton and neutron,} and m in 
(56.5) has been replaced by their reduced mass M/2 (where M is the nucleon mass). 
The matrix element is that of the proton velocity v,, since only the proton interacts 
with the photon. Expressing v, in terms of the momentum p (v, = 3v= p/M), we 
have 


2 
3 € 
0 = pail (58.1) 


The superscript (e) denotes that this formula corresponds to electric dipole tran- 
sitions: ep/M = ev, = d, so that ep,i/M = iwd,,. 

The normalized wave function of the initial (ground) state of the deuteron 
is 


b= Vo, x= vin, (58.2) 


+ The photon energy for which pa ~ 1 (with a = 1.5 x 10°" cm) is 15 MeV. 
¢ In this section, p denotes |p]. 
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where I = 2.23 MeV is the binding energy (see QM, §133).t The wave function of 
the final state can be taken to be that of free motion, i.e. the plane wave 


y= eit (58.3) 


The reason is that, in the theory under consideration, the ‘“‘size of the deuteron” 1/k 
is assumed large in comparison with the effective interaction radius a. The 
interaction between the proton and the neutron therefore has to be taken into 
account only in S states, and can be neglected in states with 140, whose wave 
functions are small at small distances. According to the selection rules, electric 
dipole transitions between two S states (the ground state and an S state of the 
continuous spectrum) are forbidden, and it is therefore possible in this case to 
neglect the nucleon interaction in the final state. 

Integration by parts gives the matrix element 


LK o--rg 2 

Pii l a fe V d°x 
= | (—) 
~ Von Py i 
= 4qp | 
2a p°+tK” 


see the second footnote to §57. 
Using also the equation 


1 2 2 p 
mu ‘« tpy=ita =o, 


which expresses the conservation of energy, we finally obtain the photodisin- 
tegration cross-section (in ordinary units) as 


8a hh’ VI (hw — 1)?” 
Co IS et Ss 
o 3 0 a7 (ho) (58.4) 
(H. A. Bethe and R. Peierls, 1935). It has a maximum at hw = 2], and tends to zero 
as hw > I or hw >. 
The electric dipole absorption of the photon, described by formula (58.4), does 
not, however, give the main contribution to the cross-section near the photoelectric 


+ This function can be made more accurate by including a correction due to the finiteness of a, the 
normalization coefficient in (58.2) being replaced by 


Visca 


see QM, (133.13). A factor 1/(1— ax) accordingly appears in the cross-section formulae. This correction 
is in fact quite large: for the ground state of the deuteron, ax ~ 0.4. 

The deuteron ground state is °S,, with a small ‘‘admixture” of *D, due to the action of the tensor 
nuclear forces (see QM, §117). This admixture will be neglected, and therefore so will the tensor forces. 
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effect threshold (fw close to I). This is because, in this range, the principal effect 
must come from transitions to an S state, and these do not occur in electric dipole 
absorption. Nor do they occur in electric quadrupole absorption, since, although 
they do not then violate the parity selection rule, they are forbidden by the 
selection rule for orbital angular momentum (the tensor forces are here neglected, 
and L and S are therefore separately conserved). To calculate the photodisin- 
tegration cross-section near the threshold, we have therefore to consider magnetic 
dipole absorption, for which the selection rules allow transitions between S states 
(E. Fermi, 1935). 
Replacing the electric moment in (58.1) by the magnetic moment, we have 


a” = 3@Mp|piil’- (58.5) 


The magnetic moment of the orbital motion makes no contribution to py, since the 
orbital angular momentum L has no matrix elements for transitions between S 
states. The spin magnetic moment 


pp — 2 [Lp Sp + 2 LnSn 


= 2(Mp — Mn )Sp + 2u,S, 


where S=s,+5,, and py, fn are the magnetic moments of the proton and the 
neutron. When the tensor nuclear forces are neglected, the total spin is conserved, 
and its operator therefore yields no transitions. Hence 


pf = 2(Sp )fi( ep — Un). 


In the same approximation (neglecting the tensor forces), the spin and coordinate 
variables are separable. The matrix element, like the wave functions, becomes a 
product of a spin part and a coordinate part: 


ty = 2p — pen)(5pS'M'lsp|5,SM) | y'*(Pb(r) dx 


But the presence of spin-spin nucléar forces has the result that the wave equation 
for the coordinate functions w(r) includes the spin value S as a parameter. If 
S'=S, then w'(r) and w&(r) are eigenfunctions of the same operator, and are 
therefore orthogonal. Thus a photodisintegration from an initial 7S state can occur 
only to a 'S state of the continuous spectrum. 

The square ||’ in (58.5) must, of course, be averaged over components M of 
the spin S in the initial state. Thus the problem is to calculate the quantity 


1 


35 01 oY MSSM 'Is>|s/SM)P, 


where s, = 5S, =3, S=1, S’=0. The general rules for matrix elements in the 
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addition of angular momenta give 


a: ae ' 2 = i S’ Sn | 2 
OST HOS TH KsS'lsellsyS= yg sy f Ksellsollse)! 
= {spp [ls 


(see QM, (107.11), (109.3)). The reduced matrix element is 
(sp||sp]sp) = VE sp (sp + 1)(2sp + 1)] = VG/2). 


Formula (58.5) then becomes 


og” = = 30Mp (pp — Ln) {wy d’x/’. (58.6) 


The imitial function w is given by (58.2); the final function is 


1 
yp’ = 2p R,o(r). 


This is the first term (I = 0) in the expansion (56.7) of a function whose asymptotic 
form comprises a plane wave and an ingoing spherical wave; an unimportant phase 
factor has been omitted. Since the integration is taken over the region outside the 
range of action of the nuclear forces, the radial function is 


2 sin(pr + 6) 
aa aa 


R,o(r) “=< 


The phase 6 is related to the value of the virtual level (I; = 0.067 MeV) of the 
proton + neutron system when S = 0: 


cot § = k,/p, Kk, = V(MI)); 


see QM, §133. Then 


! w'* us dx = ey nye im | ¢ —kr+ipr e®d 
V kK 2’? 


= (27)? —*j im ae 
pw k —ip 


After a simple algebraic reduction, we obtain the following expression for the 
photodisintegration cross-section (in ordinary units): 


(58.7) 


oe 


im) __ 87 je VI (hoa —D\VI+V 1" 
 3fhic PR Pn ho(ho —I +1) 
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When hw > I, the cross-section tends to zero as V (hw — I), in accordance with the 
general properties of cross-sections near the reaction threshold (QM, §147). 

The inverse process to photodisintegration is radiative capture of a proton by a 
neutron. The capture cross-section (a,) 1s obtained from the photoelectric effect 
cross-section (op,) by means of the principle of detailed balancing; cf. the deriva- 
tion of (56.15). The spin statistical weights of the neutron and the proton are 
2 x 2=4; those of the deuteron (in a state with S = 1) and the photon are 3 x 2= 6. 
Hence 


(tw?  _ (tw)? 
cp? 98 2Mc*(hw — 1) 


(58.8) 


CHAPTER VI 


SCATTERING OF RADIATION 


§59. The scattering tensor 


THE scattering of a photon by a system of electrons (which will be referred to 
below as an atom) consists of the absorption of the initial photon k and the 
simultaneous emission of another photon k’. The atom may be left either at its 
initial energy level or at some other discrete energy level. In the former case the 
photon frequency is unchanged (Rayleigh scattering); in the latter case the 
frequency changes by 


w'—w=E,- kp, (59.1) 


where E, and E; are the initial and final energies of the atom (Raman scattering).t 
If the initial state of the atom is the ground state, then E, > E, in Raman scattering, 
and so w'<w: the frequency is decreased by the scattering (the Stokes case). In 
scattering by an excited atom, either the Stokes case or the anti-Stokes case 
(w' > @w) may occur. 

Since the electromagnetic perturbation operator has no matrix elements for 
transitions in which two photon occupation numbers simultaneously change, the 
scattering effect appears only in the second approximation of perturbation theory. 
It must be regarded as taking place via certain intermediate states, which may be of 
one of two types: 


(1) The photon k is absorbed and the atom enters one of its possible states E, ; 
in the subsequent transition to the final state, the photon k’ is emitted; 

(II) The photon k’ is emitted and the atom enters the state E,,; in the transition 
to the final state, the photon k is absorbed. 


In this process, the matrix element is represented by the sum 


VinWnt . VonVni 
Vai 2 (gst el ag E.— 2 gen) (59.2) 


(see QM (43.7)), where the initial energy of the atom+ photons system is ¢,= 
E,+, and the energies of the intermediate states are 


€,=E, 60 =E, ++’. 


+ In this chapter, the suffixes 1 and 2 will denote quantities pertaining respectively to the initial and 
final states of a scattering system. 
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The V_ are the matrix elements for the absorption of the photon k, and the V’. 
are those for the emission of the photon k’; the initial state is excluded from the 
summation over n, this being indicated by the prime to the summation sign. The 
scattering cross-section is 


w' do’ 


do = 27|V2| 5 (Q7 yy 9 (59.3) 


where do’ is a solid-angle element for the directions k’. The radiation energy dl’ 
scattered into the solid angle do’ per unit time is expressed in terms of the intensity 
(energy flux density) I of the incident radiation by 


= I(w'/@) do. 


We shall assume that the wavelengths of the initial and final photons are large 
compared with the dimensions a of the scattering system. All transitions will 
therefore be considered in the dipole approximation. If the photon states are 
described by plane waves, this approximation is equivalent to replacing the factors 
e'*"" by unity. Then the wave functions of the photons are (in the three-dimension- 
ally transverse gauge) 


= V (47) Bo 5 get =V (47) oe ) et 


Under the conditions considered, the electromagnetic interaction operator may 
be written as 


V=-d-E, (59.4) 


where E = —A is the field strength operator and d the dipole moment operator of 
the atom (similarly to the classical expression for the energy of a small system in 
an electric field; Fields, 842). The matrix elements are 


Vi = ~iV (27H (e+ dai), Vin = iV (271w')(e* * don). 


Substituting these expressions in (59.2), (59.3), we find as the scattering cross- 
section (written in ordinary units)tT 


dao = > {u-€ e*)(dii* e) (do, ° e)(dni-e eH oe 


Wn — w — 10 @n2+ w' — 10 


hot 4 Eat (59.5) 


n 


hon, = E, -— Ei, wo'— @® = @). 


The summation is over all possible states of the atom, including those of the 
continuous spectrum (states 1 and 2 cannot appear in the sum, since the diagonal 


+ This formula was first derived by H. A. Kramers and W. Heisenberg (1925), before the develop- 
ment of quantum mechanics. 
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matrix elements d,; and d2) are zero). The infinitesimal imaginary increments in the 
denominators correspond to the usual rule for pole avoidance in perturbation 
theory (see QM, $843): an infinitesimal negative imaginary part is added to the 
energies E, of the intermediate states over which the summation is carried out. The 
avoidance rule is important when the poles of (59.5) with respect to the variable E,, 
are in the region of the continuous spectrum; for example, if state 1 is the ground 
state of the atom, this would occur for hw exceeding the ionization threshold of the 
atom.t 
We shall use the notation# (in ordinary units) 


(Ci) 21 =; >> (di)on(dk)n1 ae (dk )on(di)nt | (59.6) 


Wri — @ — 10 @n2 + w' — 10 
where i, k = x, y, z are three-dimensional vector indices. Then formula (59.5) can be 
written as 
da = w(w + 12) | (Ci ore * ek |? do'/c'. (59.7) 
The notation (59.6) is justifiable in that this sum can in fact be represented as 


the matrix element of a certain tensor. This is most easily seen by defining a vector 
quantity b whose operator satisfies the equation 


ib + wh =d. 
Its matrix elements are 
ae dit d>, 
Dat i= Wnt Don ice ee 
so that 
(Cix Jor = (by; — diby)a1- (59.8) 


The matrix elements (c;)2; will be called the radiation scattering tensor. 

It follows from the above that the selection rules for scattering are the same as 
the selection rules for the matrix elements of an arbitrary tensor of rank two. We 
can see immediately that, if the system has a centre of symmetry (so that its states 
can be classified by parity), transitions are possible only between states of the same 
parity (including transitions without change of state). This rule is the opposite of 
the parity selection rule for (electric dipole) emission, and so there is an alternate 
prohibition: transitions allowed in emission are forbidden in scattering, and vice 
versa. 

We can resolve the tensor c;, into irreducible parts: 


Cx = CS +054 €4, (59.9) 


+ In a molecule, the threshold for dissociation into atoms here takes the place of the ionization 
threshold. 
+ Most of the results derived in §§59-61 below are due to G. Placzek (1931-1933). 
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where 
c= 3Ciis 
Ch =9(Cx + Ce) — C°Six, (59.10) 
C= (Cit =p) 


are respectively a scalar, a symmetric tensor (with zero trace) and an antisym- 
metric tensor. Their matrix elements are 


@n1 + Wn2 


(cn =3 Ga Ge no) (59.11) 
(Cian =D yd an didn + (didan(didnil ~(C)ar8u, (59.12) 
(c4)n = 2 For on m8 S See ee (59.13) 


the symbols indicating pole avoidance are omitted, for brevity. 

Let us consider some properties of the scattering tensor in the limiting cases of 
low and high photon frequencies.t 

For Rayleigh scattering (w,. = 0), the antisymmetric part of the tensor vanishes 
as w—0, because of the factor w in front of the sum in (59.13). The scalar and 
symmetric parts of the scattering tensor, however, tend to finite limits as w—-0. 
The cross-section is therefore proportional to w* when w is small. 

In the opposite case, when the frequency w is large compared with all the 
frequencies @1, ®,2 Which are important in (59.6) (but of course the wavelength is 
still much greater than a), we must arrive at the formulae of the classical theory. 
The first term in the expansion of the scattering tensor in powers of 1/w is 


© Udy)an dar ~ (don (de)nil = = (dds = dda, 


and is zero, since the operators d; and d, commute. The next term in the expansion 
is 
i 
(Cio1 = re sl [won (di )on( didnt — (di)an@ni (dk ni] 


n 


ae : 
= FAC — did,,)a1. 


Using the definition d= er (with the summation over all the electrons in the 
atom) and the commutation rules for momenta and coordinates, we obtain 


Z 2 
(Cadi = — a Sik, (cio = 0, (59.14) 


+ The case of resonance (when w is close to one of the frequencies wn; and wn) will be discussed in 
§63. 
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where Z is the total number of electrons in the system, and m the electron mass. 
Thus, in the limit of high frequencies, there remains in the scattering tensor only 
the scalar part, and scattering takes place without change in the state of the system, 
i.e. the scattering is entirely coherent (see below). The scattering cross-section in 
this case 1s 


da = r2Z’|e’* - el’ do’, (59.15) 
where r, = e”?/m. After summing over polarizations of the final photon, we have 


do = r2Z7{1—(e-n’)*} do’ 
= 72Z’ sin’ @- do’, (59.16) 


which is in fact the same as the classical Thomson’s formula (Fields, (80.7)); @ is 
the angle between the direction of scattering and the polarization vector of the 
incident photon. 

Let us consider the scattering of radiation by an assembly of N identical atoms 
situated in a region small compared with the wavelength. The corresponding 
scattering tensor is equal to the sum of the tensors for scattering by each atom. It 
must, however, be remembered that the wave functions (which are used to 
calculate the dipole moment matrix elements) for several identical atoms taken 
together are not simply equal functions. The wave functions are essentially defined 
only to within an arbitrary phase factor, which is different for each atom. The 
scattering cross-section has to be averaged over the phase factor of each atom 
separately. 

The scattering tensor (c.)2; of each atom includes a factor e''*, where ¢; and 
db, are the phases of the wave functions of the initial and final states. For Raman 
scattering, the states 1 and 2 are different, and this factor is not equal to unity. In 
the squared modulus 


le;* ek BS (cixorl’, 


where the sum is over all N atoms, the products of terms pertaining to different 
atoms will include phase factors which vanish on independent averaging over the 
phases of the atoms, and only the squared modulus of each term remains. This 
means that the total cross-section for scattering by N atoms is found by taking N 
times the cross-section for scattering by one atom; the scattering is incoherent. 

If, however, the initial and final states of the atom are the same, then the factors 
e'?1-4) = 1, The amplitude for scattering by the assembly of atoms is N times that 
for scattering by one atom, and the scattering cross-section consequently differs by 
a factor N’; the scattering is coherent.t If the atomic energy level is not 
degenerate, Rayleigh scattering is therefore entirely coherent. But if the energy 
level is degenerate, there will also be incoherent Rayleigh scattering arising from 


+ Thé factor Z’ in formulae (59.15) and (59.16) has the same origin: the cross-section for scattering 
by Z electrons in one atom is Z’ times that for scattering by one electron. 
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the transitions of the atom between various mutually degenerate states. This is a 
purely quantum effect; in the classical theory, any scattering without change of 
frequency is coherent. 

The coherent scattering tensor is given by the diagonal matrix element (cx)j1, 
and will be denoted by a, omitting for brevity the index which shows the state of 
the atom. According to (59.6), 


{Aaledes 5 Aedl tar | (59.17) 


Ont -@—-10 @ato@-—i0 


ai (w) = (Ci) = >: 


This expression may also be written as 


rn ee {- 234 +— >) | APatPedae + PidelPdar |}, (59.18) 


ma’ m  L@ai—-o-10  @,,+a—i0 


using the limiting form (59.14). Here p is the total momentum of the electrons in the 
atom. The equivalence of the two forms is easily seen by noting that the matrix 
elements of the momentum and the dipole moment are connected by 


€pin/m = 1) nin 


and using the same relationships as in the derivation of (59.14). 

If the sum and difference E;+w are not equal to any of the energy levels E,, of 
the atom (including the continuous spectrum), the terms i0 in the denominators 
may be omitted. Since p*, = pai, the tensor a,x is then seen to be Hermitiant: 


Ax = af. (59.19) 


This means that its scalar and symmetric parts are real, and its antisymmetric part 
is imaginary. The latter is certainly zero if the atom is in a non-degenerate state; the 
wave function of such a state is real, and therefore the diagonal matrix elements 
are also real. 

The tensor a, is related to the polarizability of the atom in an external electric 
field. To show this relation, let us calculate the correction to the mean value of the 
dipole moment of the system when the latter is placed in an external electric field 


2(E e'*' + E* ec’), (59.20) 


This can be done by using a well-known formula of perturbation theory (QM, 840). 
If the system is subjected to a perturbation 


V = Fe i a Ft en. 


+ This result depends on the neglect of the natural line width, and therefore of the possible 
absorption of the incident radiation; see §62. 
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then the first-order correction to the diagonal matrix elements of a quantity f is 


(0) (0) (0) Fx (0) Ty * 
f(t) ane Ss {| finFn che niL'in | eg ivt a eraser ae bade a eh 


= Wnt -~o-10 a ytwtid Onto -10 @);,—at+10 


The perturbation V must be regarded as being applied with infinite slowness from 
t = —, so that in the first term w is to be interpreted as w+ i0, and in the second 
term as w—i0; the imaginary increments in the denominators have been written 
accordingly. 

In the present case 


F=-1td-E, 


and the correction to the diagonal matrix element of the dipole moment is found to 


be 
di? =3(de + d* e), (59.21) 
where d is a vector whose components are 
di = a E,. (59.22) 


The expression for the tensor a P'(w) differs from (59.17) for a; by a change in the 
sign of the imaginary part in the denominator of the second term. By definition, 
a(w) is the polarizability tensor of the atom in a field of frequency w. For 
frequencies such that the rmaginary parts in the denominators can be omitted and 
the tensor a,x is Hermitian, a; and of) are identical. In particular, when w = 0 the 
formula (59.22) becomes QM, (76.4), and the expression QM, (76.5) for the static 
polarizability tensor is the same as a;,(0) from (59.17). Note also that, if state 1 is 
the ground state,? all w,; >0 and the avoidance rule in the first and second terms in 
(59.17) is important only when w >0 and w <0 respectively. In that case, 


ax(w) = a{f(\o)). (59.23) 


The formulae of scattering theory implicitly have w > 0; the tensor ax is then the same 
as the polarizability tensor. 

We shall need not only the cross-section but also the photon scattering am- 
plitude f. As usual in perturbation theory, this is equal, apart from a normalization 
factor, to minus the matrix element (59.2). Choosing this factor so as to express the 
cross-section (59.7) in the form do =|f|? do’, we have as the elastic scattering 
amplitude 


f = w’ ae; * ex. (59.24) 
According to the optical theorem (see (71.10) below), the imaginary part of the 


+ Only this case (which will be assumed henceforward) allows a completely rigorous treatment, 
because of the finite lifetime of the excited states; see §62 below. 
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forward scattering amplitude (without change in momentum and polarization) 
determines the total cross-section o; for all possible elastic and inelastic processes 
for a given initial state of the photon: 


a, = (47/w) im(w*axeek ) = 470 (ax — a &)e* e;,/2i. (59.25) 


Thus the total cross-section is determined by the anti-Hermitian part of the 
scattering tensor. 

The formula (59.25) has a simple classical significance. The electric field E does 
work Sev: E=E-d on the system of charges per unit time. Expressing the field in 
the form (59.20), and the dipole moment in the form (59.21), (59.22), and averaging 
this work with respect to time, we find 


1 : 
30|E|e*e,(aix os a )/2i, 


with E = eE. On the other hand, if E is the incident radiation field, the mean energy 
flux density in it is |E|’/87, and the energy absorbed by the atom is |E|’o,/87. 
Equating the two expressions, we find (59.25). 

If the angular momentum J of the ground state of the atom is zero, then by 
spherical symmetry ax = ad, and 


oO; = 470 ima. (59.26) 


For a system having angular momentum, a similar relation holds for quantities 
averaged over the spatial directions of the angular momentum (see 860). 

For photon energies above the ionization threshold of the atom, the principal 
contribution to the total cross-section o; comes from the ionization process (the 
absorption of a photon in the photoelectric effect). The scattering cross-section is a 
quantity of higher order in e’; compare, for instance, (56.13) and (59.16). 

If, however, the photon energy is below the ionization threshold (but not too 
close to resonance, 1.e. to any of the discrete excitation frequencies of the atom), 
then the cross-section (which in this case reduces to the scattering cross-section), 
and therefore the imaginary part of the amplitude, are of a higher order of 
smallness than the real part of the amplitude. Neglecting the former, we again 
obtain (59.19). The situation is different in the neighbourhood of resonance, where 
the cross-section increases; this case will be discussed in §62. 

As well as scattering, the two-photon processes which occur in second-order 
perturbation theory also include double emission, i.e. the simultaneous emission of 
two quanta by an atom. 

The expression for the probability of this process differs from (59.5) only by the 
changes w > —w, e>e* (emission of a photon w, instead of absorption) and by the 
extra factor 


d’k/Qa) = w’? dw do](27r)’, 


the number of quantum states of the emitted photon in given ranges of the 
frequency w and the directions of k; the frequency of the second photon is 
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determined from w by the equation w + w’ = w;2. The emission probability per unit 
time is thereforet 


3. 73 
dw = |(by)ne!* et eesvak do do’ dw, (59.27) 


where 


(Dix )o1 = » Precareer hare a 


WOito@-10 a.+o'—10 


differs from (cx)2; in (59.6) only by a change in the sign of w. Summing this 
expression over the polarizations of the photons and integrating over their direc- 
tions of emission,£ we obtain 


7 Siw” 


anes 9mh*c® 


(Dix )o1|7 dow. (59.28) 


The probability of the emission of two photons w and w’ is usually very small in 
comparison with that of the emission of a single photon with frequency w+w’. An 
exception occurs in cases where the selection rules forbid the latter process but 
allow the former, such as transitions between two states with J =0, where all 
processes of single-photon emission are strictly forbidden. Another example is the 
transition from the first excited state (2s!) of the hydrogen atom to the ground state 
(1s!), which is forbidden for both El and M1 radiation; see §52, Problem 2.§ 

If the atom is in the field of an incident flux of photons w, k, there is not only 
spontaneous double emission, with the probability (59.27), but also induced double 
emission, in which the field causes emission of a similar photon and a photon w’, k’. 
The probability of this process differs from that of spontaneous emission by a 
factor Nxe, the number density of incident photons with given k and e. The incident 
photon flux density is 


dI = CNxe d°k/(2r)’ = Nue(w?/8 77°C’) dw do. 


Expressing Nyx. in terms of dI and dividing the probability of the process by dI, we 
obtain the cross-section 


3 
do = Foca (bu) e* et? do’. (59.29) 


Similarly, if the atom is in a field of photons w’, k’, the incidence of a photon oa, k 
causes induced Raman scattering, whose cross-section is proportional to the 
density of photons w’, k’. 


+ In the rest of this section, ordinary units are used. _ 

+ This operation amounts to complete averaging over the directions of e by eet =35x, followed by 
multiplication by 2x2 4a x 4m. 

§ The lifetime of the 2s! level for double emission is 0.15 sec. 


230 Scattering of Radiation §59 


The calculation of the tensors (cx)i2 or (bx)12 for specific atoms requires that of 
sums of the form 


(Mo) = LE (dj — =a (59.30) 


with E taking the values E,+hw or E,+hw'. To simplify the notation, let us 
discuss a hydrogen atom. We write the sum (59.30) as the integral 


M®)s, = | W(t) dG(r, 1’) dir’) dx dx’, (59.31) 
where 
py aM) 
G(r,0'; E)= > eae (59.32) 


Let the operator H — E, where H is the Hamiltonian of the atom, act on the 
function G. Since Hw, = E,W, we obtain 


(H — E)G = 2 Un(r yr (r’). 


The sum is the delta function 5(r—r’), since the set of functions w, 1s complete. 
Thus the function G satisfies the equation 


(H — E)G(r,r’; E)= 5(r-r’), (59.33) 


i.e. it is the Green’s function of Schrédinger’s equation; the avoidance rule in 
(59.32) decides which solution of this equation is to be taken. Thus the problem of 
calculating the sum (59.30) reduces to finding the Green’s function of the atom. An 
exact solution of equation (59.33) is, however, possible only if we know the exact 
solutions of the homogeneous Schrédinger’s equation, i.e. in practice only for the 
hydrogen atom.t 


PROBLEM 


Calculate the probability of elastic scattering of a (non-relativistic) electron by an almost mono- 
chromatic standing light wave (P. L. Kapitza and P. A. M. Dirac, 1933). 


SOLUTION. The standing wave may be regarded as a combination of photons with momenta k and 
—k (and equal polarizations). The scattering of the electron may be regarded as the absorption of a 
photon k and induced emission of a photon —k, so that the electron momentum p is changed by 2hk and 
rotated (without change of magnitude) through an angle 6 such that |p| sin3@ = hw/c. The probability of 
this process can be obtained from the Thomson scattering cross-section (59.15), 


da = rile’* + el? do' 
=r do', 
+t See L. Hostler, Journal of Mathematical Physics 5, 591, 1964. The application of this Green’s 


function to calculate the scattering amplitude for the hydrogen atom is given by Ya. I. Granovskii, 
Soviet Physics JETP 29, 333, 1969. 
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by multiplying by the flux density of photons with momentum k and the number of photons with 
momentum —Kk. 

The flux density of photons having frequencies in the range dw is CU... dw/2hw, where U., dw is the 
energy density in the standing wave in the spectrum interval dw; the factor 3 appears because the energy 
of the wave is equally divided between the photons moving in opposite directions. The momenta k of all 
the photons forming the standing wave are parallel to a certain direction n (the “‘direction” of the 
standing wave). In other words, the energy density as a function of the frequency and direction of the 
photons n’ is Uow = U,.6(n' — n). Accordingly, the number of —k photons is 


cf. (44.8). The electron scattering probability per unit time is then found to be 
us 2°e* 2 
w= aot | Ly dw. 


The factor w“* is taken outside the integral, since the non-monochromaticity Aw is assumed small. The 
value of the integral is inversely proportional to Aw (for a given total intensity). 


$60. Scattering by freely oriented systems 


If an atomic energy level is not degenerate, the polarizability and intensity of 
coherent scattering are determined by the same tensor ax =(cx)1;. If the level is 
degenerate, however, the observed values of these quantities are averaged over all 
states belonging to the level in question. The polarizability must be defined as the 
mean value 


Qik = (Ci 11. 


The observed scattering intensity is determined by the mean values 


(Cix )11(Cim)11- 


The relation between the polarizability and the scattering is therefore more in- 
direct. 

Although each of the quantities (c,.);; may be complex, their mean values (in the 
absence of absorption, with a;, an Hermitian tensor) are real, since on averaging we 
can choose arbitrarily the set of independent wave functions (corresponding to a 
given degenerate level), and we can always ensure that all the functions are real. 

For free atoms or molecules (not in an external field), the degeneracy of levels 
is usually due to an angular momentum which is freely oriented in space. Let the 
initial state in scattering have angular momentum J), and the final state J>. As usual, 
the scattering cross-section must be averaged over all values of the component M,, 
and summed over the values of M>. After the averaging, the cross-section is 
independent of M;, and the summation is therefore equivalent to multiplying by 
2J,.+1. Thus the averaged scattering cross-section is 


deg = wa" c §) e!*e,e;e% do’, (60.1) 
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where 


CO). 


Cikim = rT, iy 2, (Cik )o1(Cim) 5 


= (2J,+ RONA Ce : (60.2) 


the bar with index 1 signifies averaging over M,. 

For Rayleigh scattering, states 1 and 2 belong to the same energy level (w,2 = 0). 
If only coherent scattering is considered, then states 1 and 2 must coincide 
completely, so that M; = M>. In that case the summation over M>, and hence the 
factor 2J.+ 1 in (60.2), no longer appear: 


Citim = (Can)ii(Cim) "1 - (60.3) 


The result of the averaging can be written down without further calculation by 
using the fact that averaging over M, is equivalent to averaging over all orientations 
of the system, after which the mean value can only be expressed in terms of the 
unit tensor 6,, and the only non-zero mean values are those of products of 
components of either the scalar, the symmetric or the antisymmetric part of the 
scattering tensor; it is clear that the unit tensor cannot yield expressions with the 
symmetry properties of cross-products. Thus 


C fim = G1 5ixSim + C fkim + C Yet’ (60.4) 
where 
Toy 13! 
G3, = (2J,+ D\(c al’ 
ok ee ek 
C im = (2I2 + I(Cik)ailCim)51 5 (60.5) 
a ee 
Cikim: = (25. + W(ch)a(Cim) ii - 
The scattering cross-section (and therefore the scattering intensity) for a freely 
oriented system is therefore a sum of three independent parts, which will be 
referred to as scalar, symmetric and antisymmetric scattering. 

Each of the three terms in (60.4) can be expressed in terms of one independent 
quantity: the scalar scattering is expressed in terms of G%,, and for the symmetric 
and antisymmetric scattering we have 

CKin = 16G51(Sp5km + Simdit — 35;e51m), 
Gy = (2J.+ I(c§dn(e dar 5 

chee = = 6G 51(8:15iem ~~ SimOnt)s 

G3, = (252+ 1)(ci)ai(e i. , 


(60.6) 


the combinations of unit tensors are derived from the symmetry properties, and the 
common factor is then found by contracting with respect to the pairs of indices i, | 
and k, m. 
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On substituting (60.4)—(60.6) in (60.1), we obtain for the scattering cross-section 


dé = wo" {GS,\e* + el? + 15G5C1 + le’ - ef? — Fe - el?) +6G4,(1 — |e’ « e|7)} do’. 
(60.7) 


This formula shows explicitly the angular dependences and polarization properties 
of the scattering. 

The total cross-section for scattering in any direction, summed over the 
polarization of the final photon and averaged over the polarization and direction of 
incidence of the initial photon, is easily obtained directly from (60.1) by noting that 


Tea Vl 
ee, = 30ix 


if the averaging is over both the polarization and the direction of propagation of the 
photon; summation over these would give a corresponding result larger by a factor 
2X 4a. The result is 

_ 8 

5 =F wwf 


9 


= = wow" (3GS; + G5, + G4). (60.8) 


It has already been mentioned that the selection rules for scattering are the 
same as those for the matrix elements of an arbitrary tensor of rank two. Because 
of the separation of the scattering intensity into three independent parts, it is 
convenient to state the rules for each part separately. 

The selection rules for symmetric scattering are the same as those for electric 
quadrupole radiation, since the latter is likewise determined by an irreducible 
symmetric tensor (the quadrupole moment tensor). For antisymmetric scattering, 
the selection rules are the same as those for magnetic dipole radiation, since both 
are determined by an axial vector (an antisymmetric tensor is equivalent, or dual, 
to an axial vector). There is a difference here, however, in that the diagonal matrix 
elements, which in the case of emission give the mean values of the electric or 
magnetic moments (and do not correspond to radiative transitions), are important 
in the case of scattering, since they relate to coherent scattering. 

For scalar scattering the selection rules are the same as those for the matrix 
elements of a scalar. This means that only transitions between states of the same 
symmetry are possible. In particular, the values of the total angular momentum J 
and its component M must be the same, and the matrix elements diagonal in M are 
independent of M; see QM, (29.3). For Rayleigh scattering, therefore, states 1 and 
2 must coincide completely (as regards M as well as energy), and so scalar 
Rayleigh scattering is entirely coherent. Conversely, since in scalar scattering all 
states always combine with themselves, it follows that in coherent scattering there 
is always a scalar part. 


+ This refers, of course, to the selection rules based on symmetry, and not due to the specific form 
of the axial vector in the case of emission; the magnetic moment vector includes a spin part, whereas in 
scattering we have the matrix elements of orbital (coordinate) quantities. 
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For a system freely oriented in space, the polarizability tensor also must be 
averaged over the directions of the angular momentum J;, in the same way as the 
scattering cross-section has been averaged above. The averaging is very simply 
carried out: we evidently have 


——,} —7—1 
ain = (Cx) = (C1 Sx. 


The symmetric and antisymmetric parts of the scattering tensor vanish on 
averaging, since 6, is the only isotropic tensor of rank two. 

It has been mentioned that the diagonal matrix elements of a scalar are 
independent of M;. The mark of averaging of (c°),, may therefore be omitted, and 
this quantity calculated for any M,, so that the polarizability is 


ain = (C°) 15x. (60.9) 


For the same reason, the averaging sign may be omitted in the quantity G?;, which 
determines the scalar part of the coherent scattering: 


G8, =(( nF = (ci; (60.10) 


the factor 2J,+1 is omitted in accordance with (60.3). Thus there is a 
simple relation between the mean polarizability and the scalar part of the coherent 
scattering: both are determined by the quantity 


(C11 = 3 Do ae an \dial’. (60.11) 


PROBLEMS 


PROBLEM 1. Find the angular distribution and the degree of depolarization in the scattering of 
linearly polarized radiation. 

SOLUTION. Let @ be the angle between the direction of scattering n’ and the direction of 
polarization e of the incident radiation. The scattered radiation has two independent components, 
polarized one in the plane of n’ and e (intensity I;) and one perpendicularly to this plane (intensity I); 
the degree of depolarization is I/I;. The intensities I; and Ib are given by (60.7) with the appropriate 
directions of e’. 


In scalar scattering, the radiation remains completely polarized in the same plane (I2=0), and the 
angular distribution of intensity is 


I =3sin’ 6. 
Here and below, the expressions for I = I,;+ I, are normalized so as to give unity on averaging over 
directions. 
In symmetric scattering 
= 39(6 + sin’ 6), L/T, = 3/G + sin’ 0). 
In antisymmetric scattering 


I=i0+cos’6), = bi, = 1/cos’ 6. 


PROBLEM 2. The same as Problem 1, but for the scattering of natural light. 
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SOLUTION. Formula (60.7) can be applied to natural (unpolarized) incident light by the substitution 
eet > 3(Six — nunk), 


which corresponds to averaging over the direction of polarization e with a given direction of incidence n. 
The scattered light will be partly polarized, and from considerations of symmetry it is evident that its 
two independent components will be linearly polarized in the scattering plane of n and n’ (intensity Ij) 
and perpendicularly to this plane (intensity I,). The scattering angle between n and n’ will be denoted by 


| For scalar scattering 
I=I,+h=i1+cos’ 9), Ii/1. = cos’ 9. 
For symmetric scattering 
I =%(13+cos’ 9), ‘y= (6+ cos’ 9)/7. 
For antisymmetric scattering 
1=3(2+sin’ 9), =/I.=1+ sin’ 0. 
PROBLEM 3. For scattering of circularly polarized radiation, determine the reversal factor (the ratio 


of the intensity of the component circularly polarized in the ‘reverse’ direction to that of the 
component polarized in the original direction). 


SOLUTION. For circularly polarized incident radiation, the angular distribution and the degree of 
depolarization (J\/I.) are the same as in the scattering of natural light. 

Let the vector e of the incident radiation have components (1//2)(1, i, 0) in coordinates such that the 
xz-plane is the scattering plane and the z-axis is along n. Then the polarization vectors for the reverse 
and original circularly polarized components of the scattered radiation are 


ay xferass ae | ts 
e' = 75 (cos 9, i, —sin 3) and e' = 75 (cos 0, i, ~sin D). 


Calculation of the intensity by means of (60.7) gives the reversal factors P for the three types of 
scattering: 


; i , 134+cos’? 8 +10cos 8 a 1—cos*79 
= 2 b) = ] bd] = © 41 bd] 
any, : 13+ cos’ 3-10 cos 3 ‘s 1—sin 33 


where @ is the scattering angle. 


PROBLEM 4. Calculate the cross-section for scattering of a low-frequency photon by a hydrogen 
atom in the ground state. 


SOLUTION. A low-frequency photon can undergo only elastic scattering. Since the orbital angular 
momentum | of the hydrogen atom in the ground state is zero, the selection rules (neglecting the spin-orbit 
interaction) allow only scalar scattering. The static polarizability of the atom is (in ordinary units) 
a = (9/2)(h?/me’)’; see QM, $76, Problem 4. Substitution in (60.8) gives the required cross-section: 


ot = 542(w/c)‘(h?/me’)’. 


PROBLEM 5. Calculate the cross-section for elastic scattering of y rays by a deuteron (H. A. Bethe 
and R. E. Peierls, 1935). 


SOLUTION. The wave functions of the deuteron ground state and of its continuous-spectrum states 
(the dissociated deuteron) are 


wo= Vx, p= et, «= VMN: 


Qn or’ 


see (58.2), (58.3). The matrix element of the dipole moment is dpo = —ieppo/ Mw o and has been calculated 
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Amie | kK  p 


dpo =~ Moyo V 277 k* + p”’ 


with the frequencies wpo = (p+ k’)/M. 
The polarizability tensor is 


| d°p 7 e 5, 
au = {5 5 7 laos! Oy sor }5™ 
The first term is due to the virtual excitation of the internal degrees of freedom of the deuteron, and is 
written in the form (60.11). The second term is due to the action of the wave field on the translational 
motion of the deuteron as a whole. Since this motion is quasi-classical, the corresponding part of the 
scattering tensor is given by (59.14), with m replaced by the deuteron mass 2M. 

The calculation of ax depends on that of the integral 


: 4 
d 
=| Gepiecyp TPs y= Malk?= ol 


We have 


where 


Jai z* dz 
2 74 I+ DT 


When y <1, the integrand has poles at the points iA, iV(1+ y), iV(1 — y) in the upper half-plane of the 
complex variable z; the integral Jy can be calculated from the residues at these poles. The result is 


3/2 3/2 
Janey 4 Ley) (= +a)}. 
2 2y 2y By 


The total scattering cross-section is expressed in terms of ax by (60.8), and is (in ordinary units) 
ae Sz ( e ) 

3 \Mc? 
For y > 1 the scattering amplitude (above the deuteron dissociation threshold) is found from that for 


y <1 by analytical continuation; it has an imaginary part, which must be positive (in accordance with the 
avoidance rule in (59.17)): 


ee: 2 3/2 aryl? 
1 a a Lye] 


for y = hw/I <1. 


Sz ( e ) 
3 \Mc? 
When y > 1 we have o = (87/3)(e"/Mc’)’, which agrees, as it should, with (non-relativistic) scattering by 


a free proton. 
The angular distribution of radiation is 


: 2 
ice 3/ ys 


4 2 de 
Pg tiga yt) Tae for y> 1. 


do = 0 -a(1+ cos’ 0) do/4n, 
where @ is the scattering angle. If the scattering amplitude is defined by (59.24), we have 


(y Oy Le? 


im f(0) =e ae for y > 1. 


According to the optical theorem (59.26), this quantity must equal wo./4a, where o; is the total 
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cross-section for photodissociation (58.4). The elastic scattering cross-section is of a higher order (~ e’) 
than the dissociation cross-section (~e’; see (58.4)), and therefore a; is equal to the dissociation 
cross-section. For the same reason, in the approximation considered, the scattering amplitude was found 
to be real for y <1 (.e. below the dissociation threshold). 


§61. Scattering by molecules 


The specific properties of molecular scattering are due to the same properties of 
molecules as form the basis of the theory of molecular spectra, namely the 
possibility of treating separately the state of the electrons with the nuclei fixed and 
the motion of the nuclei in a given effective field of the electrons. 

Let the frequency w of the incident radiation be less than the energy w, of the 
first electron excitation. Then the electron terms will not be excited in the 
scattering process. The scattering will be either Rayleigh scattering, or Raman 
scattering due to the excitation of rotational or vibrational levels. 

Let us further assume that the electron ground term of the molecule is not 
degenerate (and has no fine structure). That is, we assume that the total spin of the 
electrons and the component of their total orbital angular momentum along the axis 
of the molecule (for molecules of the symmetrical-top type) are both zero. 
For diatomic molecules this means that the electron ground term must be ‘DY. 
These conditions are known to be satisfied for the ground states of most mole- 
cules.t 

Finally, we shall assume the frequency w large compared with the intervals in 
the nuclear (rotational and vibrational) structure of the ground term, and the 
difference w,— w to be in a similar relation to the nuclear structure of the excited 
term. Thus the frequency of the incident radiation must be sufficiently far from 
resonances. These conditions make it possible, in calculating the scattering tensor, 
to ignore at first the motion of the nuclei and to discuss the problem with a given 
configuration of the nuclei. 

In such a problem, the scattering tensor is the same as the polarizability tensor, 
Qik = (Cix)11, and can in principle be calculated from the general formula (59.17), in 
which the summation is over all excited electron terms. The quantities a; thus 
obtained will be functions of the coordinates q of the nuclear configuration (the 
energies and wave functions of the electron terms depend on these coordinates as 
parameters). Since the state is not degenerate, the tensor ag(q) is real, and 
therefore symmetrical. 

The tensor ax(q) is the electronic polarizability of a given nuclear configuration 
in the molecule. To solve an actual problem of scattering, we have also to take into 
account the motion of the nuclei in the initial and final states. Let W,,(q) and w,,(q) 
be the nuclear wave functions of these states, s; and s2 being the sets of vibrational 
and rotational quantum numbers. The required scattering tensor is the matrix 


+ The results given below are, however, valid (to a certain approximation) also for cases where 
degeneracy of the electron ground term is due to a non-zero spin, the spin-orbit interaction being small 
(so that the resulting fine structure may be neglected). In this approximation, states with different spin 
directions do not combine, and in this sense they behave as if they were not degenerate. The molecule 
On, with ground term *S, is of this type. 


QE4-Q 
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element of the tensor a;(q) with respect to these functions: 


(srlaulsi) = [ W8(@)ax(aia(4) da. (61.1) 


Because the tensor a,x(q) is symmetrical, so is the tensor (61.1) (whether s,; and s, 
are the same or not). Thus we conclude that, under the conditions stated, there will 
be no antisymmetric part in either Rayleigh or Raman scattering. The scattering 
will include only scalar and symmetric parts. 

The scalar part a°(q) of the polarizability is independent of the orientation of 
the molecule, and depends only on the internal configuration of the atoms within it. 
Let v denote the set of vibrational quantum numbers of the molecule, and r the set 
of rotational numbers other than the magnetic number m. Then the matrix elements 
are 


(vorzm|a°|viTmM1) a (vo]a°|01) 8,7, 51m, mp: (61.2) 


The diagonality with respect to the numbers r and m is true of any scalar. The 
particular property of (61.2) is that here the elements do not depend on these 
numbers at all. Thus the scalar scattering occurs only for purely vibrational 
transitions and does not depend on the rotational state. 

The symmetric scattering is determined by the matrix elements of the tensor 
ai,. Its components in a fixed coordinate system xyz are expressed in terms of the 
components a}, in a system nf moving with the molecule by 


Oi = > ai Di Dy; (61.3) 


where the D,;; are the direction cosines of the new axes relative to the old. The 
quantities a}, do not depend on the orientation of the molecule, and the D;; do not 
depend on the internal coordinates. Hence 


(v2r2mM|a i017) = >} (v2|@},01){r2M|DyiDex|rim)). 
I, 
The sum of the squared moduli of these quantities over r2, m2, i, k is easily seen to 


bet 
2X  ((voramalailoirim)? = [Cvsla ilo? (61.4) 


rp, M> I, 


+ In transforming the sum we use the equation 


2 > <rim|DuDxg|r2m2)(r2ma| Dir Dyg|rimi) 


»K 62, m2 


= (rim p> DiaDigDiv Dre! 


rim) = (rym |S Sge|rim1) = Su Sg¢', 


which expresses the unitarity of the matrix Dx. 
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This means that the total intensity of scattering with transitions from a given 
vibrational—rotational level v,, r; to all rotational levels of the vibrational state v2 is 
independent of 1. 

For molecules of the symmetrical-top type, we can go further and derive a 
relation between the scattering intensity and the rotational quantum numbers for 
every transition v,r;— v2r. In this case the numbers r are the angular momentum J 
and its component k along the axis of the molecule. We replace the Cartesian 
components of a}, by the corresponding spherical tensor of rank two, denoting its 
components by a, (A = 0, +1, +2). According to QM, (110.7), the squared moduli of 
its matrix elements are 


|(v2J2k2m|a,|v,J.kym,)|" 


= J, 2 ny( Jn 2 Si ) = 2 
=Qh+DAR+D( 2X Cae 1 sg,) Kealantook, 
where @,(q) is the spherical polarization tensor relative to axes fixed in the 


molecule, and A’ = k,—k,. Summing over m, and A = m.—m, (with m fixed), we 
obtain (cf. QM, (110.8)) 


Ir 2 Ff . 
Ds (v2Jakym.|o,|v,Jikim)| = (22+ D(_ ke; . <)Kealaloo?. (61.5) 
mM, 


This quantity determines the intensity of scattering with the vibrational—rotational 
transition v,J,k,—> v,J,k2. Since the matrix elements (v2|@,|v;) do not depend on the 
rotation of the molecule, this also defines the dependence of the intensity on J,, J, 
and on k,, k2. The right-hand side of (61.5), it may be noted, involves only one 
spherical component of the polarizability tensor. 

Summation of (61.5) over J, and k, givest 


> » (v2Jok2m2|e, |v ,Jikym,)\ a > Kvajay 0.1’, 


A Jo, Kp, m2 d! 


and we return to the sum rule (61.4). 

A special case of the symmetrical top is the rotator, a linear molecule (or, as a 
particular instance, a diatomic molecule). The angular momentum component along 
the axis of such a molecule is zero (in a non-degenerate electronic state with zero 
electronic orbital angular momentum).¢ In this case, therefore, we must put 
k, = k, = 0 in (61.5). 

Finally, let us consider the question of the selection rules in vibrational Raman 


+ In the summation over Jz with given k;, X’ (and k7 = ki+ X’), we have 
Jn 2 a) 
= 1 
> Qh+ Dba a 


according to QM, (106.13). The summation over k2 (or, equivalently, over A’ = k2— ki) is then effected 
for given ki. 

+ Here we do not include effects due to the interaction between the vibrations and the rotation of the 
molecule (see QM, §104). 
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scattering, together with the cognate question of vibrational emission (or ab- 
sorption) spectra of molecules.t 

For scattering, the problem is simply to find the conditions under which there 
are non-zero matrix elements of the tensor a,(q) with respect to the vibrational 
wave functions wW,(q); the scalar a’ (for scalar scattering) and the irreducible 
symmetrical tensor ai, (for symmetric scattering) have to be considered separately. 
A corresponding role in emission (or absorption) is played by the matrix elements 
of the vector d(q), the dipole moment of the molecule averaged over the electronic 
State with a given position of the nuclei. This has already been stated in $54 for 
diatomic molecules. 

The vibrations of a polyatomic molecule are classified according to types of 
symmetry, the irreducible representations D, of the corresponding point group, 
where a numbers the representation (see QM, 8100). These representations also 
define the symmetry of wave functions of vibrational states of the molecule (see 
QM, §101). The symmetry of the wave functions of the first vibrational state 
(quantum number v, = 1) is the same as the symmetry D, of the vibration type; the 
symmetry of the higher states (v, > 1) is given by the representations [D7], which 
are symmetric products of v, representations D,. Finally, the symmetry of states in 
which different vibrations a and b are simultaneously excited is given by the direct 
product [D*%] x [D}*].4 The selection rules for the various quantities (scalar, vector, 
tensor) with respect to types of symmetry are found as described in QM, §97. 

The selection rules resulting from the symmetry properties of the molecule are 
rigorous. There are also approximate rules based on the assumption that the 
vibrations are harmonic and that the functions a,(q) or d(q) can be expanded in 
powers of the vibrational coordinates q. These are a consequence of the known 
selection rule for a harmonic oscillator, according to which the matrix elements of 
the oscillator coordinate q are zero except for transitions in which the change in 
the vibrational quantum number Av = +1. 


§62. Natural width of spectral lines 


So far, in the study of emission and scattering of radiation, we have regarded all 
the levels of the system (an atom, say), as being strictly discrete. But in fact excited 
levels have a certain probability of decay by emission, and therefore a finite 
lifetime. According to the general principles of quantum mechanics, this has the 
result that the levels become quasi-discrete, with a certain small but finite width 
(see QM, §134); they can be written in the form E —3iI’, where I'(=I/h) is the total 
probability (per unit time) of all possible processes of “decay” of the state 
concerned. 

Let us consider how this situation affects the process of emission (V. Weisskopf 
and E. Wigner, 1930). It is evident that, because of the finite width of the levels, the 
emitted radiation will not be strictly monochromatic: its frequencies will be spread 


+ These spectra lie in the infra-red, and are usually observed as absorption spectra. 

+ The symmetry properties of the vibrational wave functions are, of course, independent of the 
specific form of the vibrational potential energy, and in particular are independent of the assumption 
made in QM, §101, that the vibrations are harmonic. 
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over arange Aw ~ I (=I'/h). But, in order to measure the frequency distribution of 
the photons with this accuracy, the time needed is T > 1/Aw ~ 1/I. During this time 
the level will almost certainly decay by emission. We therefore have to deal with 
the determination of the total probability of emission of a photon of a given 
frequency, not with the probability per unit time. We shall calculate this total 
probability, first of all, for a transition of an atom from some excited level E, —3iT, 
to the ground level E,, which has an infinite lifetime and is therefore strictly 
discrete. 

Let VY be the wave function of the atom and the photon field, and H = H®+V 
the Hamiltonian of the system, where V is the atom-field interaction operator. We 
shall seek a solution of Schrodinger’s equation 


“ = (H+ vy (62.1) 


in the form of an expansion in terms of the wave functions of the unperturbed 
States of the system: 


v= > a(tyv?= > a(t) e y®. (62.2) 
For the coefficients a,(t) we obtain the equations 
j a => (o|V|v")a, expli(é, — 8,)t}. (62.3) 


Let |v) be a state with energy 6, = E,+., in which the atom is at the ground 
level E, and there is one quantum with a definite frequency w; this state will be 
symbolized by |w2). At the initial instant, the system is in the state |1), the atom 
being excited to the level E,, with no photons present. Thus, for t = 0 we must have 


a,=1, a,=0 for |v')¥|I1). (62.4) 


The solution of equation (62.3) with this initial condition will give (with the 
appropriate normalization of the wave functions) the probability that at time t there 
has been a transition 1-2 of the atom with emission of a photon in the frequency 
range dw: it is |a,2(t)|? dw. We are interested in the ultimate probability as t > ~: 


dw = |a,2(~)| dw. (62.5) 


In order to clarify the problem, it may be recalled that, in finding the ordinary 
emission probability (per unit time) with a transition 1-2 (neglecting the level 
width), equation (62.3) has to be solved with all the a,(t) on the right-hand side 
replaced, to a first approximation, by the values (62.4). The solution thus obtained 
is then examined for large t; cf. QM, §42. We can now describe this procedure 
more precisely; it relates to times short in comparison with the lifetime of the 
excited level, and the large values of t concerned are large compared with 
1/(E; — E2) but small compared with 1/T,. 
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In our present case, where times comparable with 1/I, are considered, the 
function a,(t) decreases in time according to 


a(t) =e", (62.6) 


The functions a,(t) for states |v’) which can result from emission by the atom 
increase with time, however. If the transition from a given level FE, can occur to 
various atomic levels (as well as to E>), there will be many increasing functions 
a,(t), each corresponding to a state in which the atom is at a certain level and there 
is one photon with the appropriate energy. Nevertheless, there still remains on the 
right of (62.3) only the term with |v’)=|1): since the matrix elements are zero 
except for transitions in which the number of photons with some one energy 
changes by 1, they are certainly zero for transitions between states containing one 
photon each, with different energies. 
Thus we have for a, .(t) the equation 


“ = (w2|V|1) eto Eg, 


= (@2|V|1) exp{i(w — w.)t —3P st}, (62.7) 


where w12 = E, — E>. Integration, with the condition a,,(0) = 0, gives 


— exp{i(w — w12)t — 41 \t} 


1 
duo = (w2]VI1) O— W2+3iT, 


(62.8) 


Hence the probability dw (62.5) is 


= 2 dw 
dw = \(w2| V1) (© — op) +e 
Since the width [,<w, we can put w =, in the factor |(w2|V|1)/’. Then the 
quantity 27|{w2|V|1)/’ is the ordinary probability (per unit time) for the emission of 
a photon with frequency w),. and other properties besides the frequency, such as 
the direction of motion and the polarization, whose existence has so far been 
ignored in order to simplify the notation. The dependence of the probability on 
these characteristics is entirely determined by the factor |(w2|V|1)|’. Thus the 
allowance for the level width does not affect the polarization properties or the 
angular distribution of the radiation. 
The sum 


Pi. = 27 DY (w2|V|1)/, (62.9) 


taken over the polarizations and directions of motion of the photon, is the usual 
total probability of emission. It is also the part of the width of the level E, (the 
partial width) which ts due to the transition 1-2, as distinct from the total width 
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I,, which is made up of contributions from all possible modes of “‘decay”’ of the 
quasi-stationary state considered.f 

By a similar summation of the probability dw, we obtain the following final 
formula for the frequency distribution of the emitted radiation: 


Ty dw 


O0 = Oe (on oF TP 


(62.10) 


where w, = I',.,2/T, is the total relative probability of the transition 1-2. This is a 
dispersion-type distribution. The shape of the spectral line that is given by formula 
(62.10) is that which occurs for an isolated atom at rest, and is called the natural 
shape.¢ 

Now let the level E, of the atom be also an excited level with a finite width I. 
We shall assume for simplicity that this width is due to transitions of the atom to 
the ground state E, with the emission of one photon; the final result (62.12) will not 
depend on this assumption. The decay of state 1 can then be regarded as an 
emission of two photons, discussed in §59. The matrix element for this process (not 
yet taking account of the finite lifetime of state 2) is 


(cow’0| V1) = S22 a TaN al (62.11) 


a pee , 


the state 2 in (59.2) becomes state 0, and in the sum over n the only remaining term 
corresponding to the atom in state 2 is the one which is large by resonance when w’ 
is close to E,— Epo. If we now take account of the finite lifetime of state 2, this 
simply changes E, into E,)—4iI, in (62.11), giving 


ier) wo'0| Vi w2){(w2! V}1) 
(ww'0) V1) = feo eon fi 


0) '+5i1 > 


Substituting this value of the matrix element in the equation for a,,.2(t) (which 
differs from (62.7) only as regards notation), we obtain by a derivation exactly 
similar to that of (62.8) 


ieee (ow '0 V|w2){w2) V1) —_ 


(w’ — Wa + 31D 2)(w + w' — 19 +311 )) 


The probability of emission of the photons w and w’ is 


W = |dowo(®)|? dw dw’ 


— Lig Tro dw dw’ 


2a 2m [(w' — wo)? + I 3][(@ + w’ — wy) +43] ean!) 

+ Formulae (62.6) and (62.9) can, of course, also be obtained by solving the equation for ai(t) 
analogous to (62.7). 

We may note that transitions to states of the continuous spectrum, causing a finite level width, do not 
necessarily involve the emission of photons. Highly excited (X-ray) levels can decay with emission of an 
electron and formation of a positive ion in the ground state (the Auger effect). 

t As distinct from the broadening caused by the interaction of the atom with other atoms (collision 
broadening) or by the presence of atoms in the source which move with various velocities (Doppler 
broadening). 
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This expression has sharp peaks at w’ ~ w and at w ~ wp, as it should. 

The shape of the spectral line corresponding to the transition 1-2 is obtained 
by integrating (62.12) with respect to w’; the range of integration can extend from 
—« to +. The integral is most simply calculated by closing the contour of 
integration with an infinite semicircle in the upper half of the complex w'-plane, and 
is given by the sum of the residues of the integrand at the poles 


wo! = 09 +3iTo, wo = wy—w til). 
The result is 
_ | = Tr; dw 
dw = w; i (=o oe =) (62.13) 


where w, =[).I30/T' TP. is the total probability of the double transition 1>2—-0.f 
The line shape (62.13) differs from (62.10) only in that I’, is replaced by I, +T,: 
the line width is equal to the sum of the widths of the initial and final states. 
The line width is not, in general, equal to the probability I'|. of the transition 
1—2 itself, i.e. is not proportional to the line intensity as in the classical theory. 
Since [, + I, >T,.2, the line can have a large width with a relatively small intensity. 


§63. Resonance fluorescence 


The allowance for the finite width of the levels in problems of radiation 
scattering is important when the frequency w of the incident radiation is close to 
one of the “intermediate” frequencies @,; OF w2,; this is called resonance fluores- 
cence (V. Weisskopf, 1931). 

Let us consider Rayleigh scattering by a system (an atom, say) in the ground 
state, so that the initial and final levels are the same and are strictly discrete. Let 
the frequency of the radiation be close to a certain frequency w,;, where the level n 
is an excited level and is therefore quasi-discrete. 

This problem could be solved by the method shown in 862, but there is no need 
to do so, since it is exactly analogous to the problem of non-relativistic resonance 
scattering at a quasi-discrete level (QM, 8134). According to the results derived 
there, the scattering amplitude must contain a pole factor 


1 
@ — (En —51L, = E,) 


When |w — »,,|>T,, on the other hand, the result must tend to the non-resonance 
formula (59.5). It is therefore clear that the required scattering cross-section is 
obtained by simply replacing E, by E, —3iT', in (59.5); the sum over n can be 


+ In more complex cases, w; is the total probability of all cascades which begin with the transition 
1-2 and finish at the level 0. 
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restricted to the resonance terms: 


2 
S (don e*)(dar 5) 
= M. 


ae 
(@;-oytW2 


do * do’. (63.1) 


The summation is over all states (having different angular-momentum components 
M,,) corresponding to the resonance level E,; the states 1 and 2 belong to the same 
level (the ground level), but may differ in the values M, and M). 

The cross-section (63.1) has its maximum value when w = w, , and this value is, 
in order of magnitude, ona, ~ w*d*/[;%. Since the probability of the spontaneous 
transition n > 1, and hence the width T,, ~ wd’, this value is 


Omax ~ 1!w? ~ dr’, (63.2) 


of the order of the square of the wavelength and independent of the fine structure 
constant, as compared with typical values ~r2 outside the resonance region. 

It must be emphasized that, since the atom is at a strictly discrete level (the 
ground level) before and after the scattering, the frequencies of the primary and 
secondary photons are exactly the same. If the incident radiation is mono- 
chromatic, the scattered line will therefore be monochromatic also. If the incident 
radiation has a spectral intensity distribution I(@) which varies only slightly over 
the width I, the intensity of scattered radiation will be proportional to 


I(@n1) dw 
(@ — Oni +402 ore) 
Thus the shape of the scattered line will be the same as the natural shape for 
spontaneous emission from the level E,. 

The cross-section (63.1) corresponds to the scattering tensor 


>, (didan(dk)n1 
(Ci )o1 = fh, TT" (63.4) 


Wni— @ Flin 


In particular, the polarizability tensor is 


>»; (di) in€dk ni 
ik = (Ck) = eames (63.5) 


Wni W—Ill nn 


It can be seen immediately that the addition of an imaginary part to the energy 
levels of the intermediate excited states makes the polarizability tensor no longer 
Hermitian, even at frequencies below the ionization threshold. It contains an 
imaginary part which is directly related to the absorption of radiation. 

After absorbing a photon, the atom will sooner or later return to the ground 
state, emitting one or more photons. The absorption cross-section, viewed in this way, 
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is just the total cross-section o; for all possible scattering processes.t On the other 
hand, according to the optical theorem (59.25), the cross-section can be expressed in 
terms of the anti-Hermitian part of the polarizability tensor. 

Substituting in (59.25) the tensor a, from (63.5), we find the following formula 
for the cross-section for absorption of a photon with frequency w close to wn: 


1 
_ aT n 
= 477° 2 Ida’ el’ 0) allo re, ty (63.6) 


In the limit as [,, > 0, the last factor tends to the delta function 6(@ — wn), in 
accordance with the fact that in this case only a photon having one particular 
frequency can be absorbed. Let radiation with a spectral and angular energy flux 
density I. (cf. (44.7)) be incident on the atom. Then the flux density of number of 
photons is (Ixe/w) dw do, and the probability of absorption is 

dW = OaUe/w) dw do. (63.7) 


If the function I,,(w) varies only slightly over the width I,, then we have after the 
integration over frequencies 


dW, a 4m? > ldat ; e|*Tie(@n1) do. 
M,, 
According to (45.5), 
w° 2 
dws, = De 2 Idi : e*| do 
3 
ale. Ae 


is the probability of spontaneous emission of a photon having the frequency oy; 
thus we return to formula (44.9). 


+ This discussion, it must be emphasized, refers to absorption by a system in its stable ground state. 
The problem would have to be stated differently for an excited state, because of the finite duration of 
the experiment. 


CHAPTER VII 


THE SCATTERING MATRIX 


§64. The scattering amplitude 


THE general problem concerning collisions is to find, for a given initial state of the 
system (an assembly of free particles), the probabilities of various possible final 
states (other assemblies of free particles). If |i) denotes the initial state, the result 
of the collision can be represented as the superposition 


2 LF <FISIi), (64.1) 


in which the summation is taken over the various possible final states [f). The 
coefficients in this expansion (f|S|i) (or, more concisely, S;), form the scattering 
matrix or S-matrix. The squares |S;|’ give the probabilities of transitions to 
particular states |f). 

If there were no interaction between the particles, the state of the system would be 
unchanged, corresponding to a unit S-matrix (absence of scattering). It is con- 
venient to separate this unit matrix in all cases, writing the scattering matrix in the 
form 


Si = 8 + (276 OP; — Pi) Thi, (64.2) 


where T;; is another matrix. In the second term we have written separately the 
four-dimensional delta function which expresses the law of conservation of the 
4-momentum (P; and P; being the sums of the 4-momenta of all the particles in the 
initial and final states); the other factors are included for subsequent convenience. 
In the non-diagonal matrix elements, the first term in (64.2) does not appear, and so, 
for the transition i—f, the elements of the matrices S and T are related by 


Syi = i(27)*6(P; — P;)T 4. (64.3) 
The matrix elements T;; which remain after separation of the delta function will be 
called the scattering amplitudes. 


When the moduli |S;;| are squared, the square of the delta function appears, and 
is to be interpreted as follows. The delta function comes from the integral 


8(P; — Pj) = aay | el(Pr-POe qty, (64.4) 


If another such integral is calculated with P,; = P; (since one delta function is 


247 
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already present), and if the integration is taken over some large but finite volume V 
and a time interval t, the result is Vt/(27r)*.t Thus we can write 


[Syl = (277)*5(P; ve P;)|T)i|’ Vt. 
Dividing by t, we obtain the transition probability per unit time: 
Wrei = (277)°5(P, = P;)|T;|°V. (64.5) 


Each of the free particles, initial and final, is described by its own wave 
function—a plane wave having some amplitude u (a bispinor for an electron, a 
4-vector for a photon, and so on). The structure of the scattering amplitude Ty; is of 
the form 


T;, = ufus...Quyu2..., (64.6) 


where on the left we have the amplitudes of wave functions of final particles, and 
on the right those of initial particles; Q is some matrix relating to the indices of the 
wave amplitude components of all the particles. 

The most important cases are those where the initial state comprises only one 
or two particles. Then we have respectively the decay of one particle or the 
collision of two particles. 

Let us first consider the decay of a particle into any number of other particles 
having momenta pj, in an element II d°p, of momentum space; the suffix a labels 
the particles in the final state, so that = p, = P;. The number of states in this element 
and in the normalization volume? V is 


I] V d*pi/Qr)°. 


The expression (64.5) must be multiplied by this quantity: 


Joao) 
dw = (277)*5O(P; oF P;)|T;i lV I] Ont 


(64.7) 

The wave functions used in calculating the matrix element must be normalized 
to “one particle in the volume V”’. For an electron, e.g., the wave function is the 
plane wave (23.1); for a particle with spin one it is (14.12); for a photon it is (4.3). 
All these functions include the factor 1/V(2eV), where « is the energy of the 
particle. Henceforth, however, it will be convenient to omit such factors in the 
wave functions, and include them in the expression for the probability. Thus the 


+ This can be shown in a different way by first calculating the integral over each coordinate in (64.4) 
for a finite range and then making the limits tend to infinity by means of QM, (42.4): 


sin’ ag 


lim ——3> = 78(a). 
00 &a 


+t For greater clarity, in the calculations in this section, we shall not take V to be unity. 
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electron plane wave will be 
w=ue ”, iu = 2m, (64.8) 
and the photon wave 
A=V(4r)ee™, ee*=-—|1, ek=0. (64.9) 


The scattering amplitude calculated with these functions will be denoted by M;; 
to distinguish it from Tj. Evidently 


M;, 
(== re ; , 
Ty (Q6eV...2e\V...)! (64.10) 


the denominator contains one factor V(2«V) for each initial or final particle. 
In particular, the decay probability 1s, instead of (64.7), 


1 d°p} 
— 4 9 (4) — Pp. 4 ee ~~ es 
dw = (2n)'8(P; — P)MaP 5-1] Gace (64.11) 


where «e is the energy of the decaying particle; as we should expect, the nor- 
malization volume does not appear in this formula.t 

Formula (64.11) can be given a more definite form by eliminating the delta 
functions, if the decay produces two particles (with momenta pj, p; and energies e1, 
€5). In the rest frame of the decaying particle pj = —p;=p’, e+ «=m. We have 
> — 


_ if 3 3.,/ 
dw = yp |Mn = rare —; 5(pi + p2)d(e1 + €2 - m) d°p; d’p3. 


The first delta function is eliminated by integration over d°p}; the differential d’*p}{ 
is written as 


dp’ = p” d|p’| do 


Se7 dence 2) 
£1 it £4 


= |p'| do (64.12) 


The validity of this is easily seen by noting that ¢/?-—m/?=e/—m¥=p”. The 
integration over ¢; + €; eliminates the second delta function, and the result is 


dw = ao s5n7c2|Mrllp'| do’. (64.13) 


Let us now consider a collision of two particles (having momenta p, and p» and 


+ If the final particles include N which are identical, a factor 1/N! must be inserted when integrating 
over their momenta to obtain the total probability; this factor takes into account the identity of states 
which differ only by an interchange of the particles. 


250 The Scattering Matrix $64 


energies ¢; and e2), in which they are transformed into any number of particles 
having momenta pj. Instead of (64.11) we now have 


1 dp} 
= 49(4) _p. [2 a 
dw = (27)*6(P; — Pi)|Mg| fae Llane 


The quantity that is of interest in this case is, however, not the probability but 
the cross-section do. The cross-section invariant under the Lorentz transfor- 
mations is obtained from dw on dividing by 

j=1/Ve je, (64.14) 
where I denotes the 4-scalar 


I= VE(pip2) = mim3]; (64.15) 


see Fields, §12.+ In the centre-of-mass system (p; = — p2=p) 


I= lpl(e, a5 £2), (64.16) 
so that 
. — [pl (+ +) _ vito, 
J= V ae a Vv (64.17) 


which is the same as the usual definition of the flux density of colliding particles, v, 
and v2 being their velocities. Thus the cross-section is 


1 d°p! 
= 4 (4) — Pp. R Poe a 
do = (277)°8‘(P; — P;)|Mji| ar |, On)2e! Det (64.18) 


This formula can be put into its final form by eliminating the delta function for 
the case where in the final state also there are only two particles. Let us consider 
the process in the centre-of-mass system, and let ¢ = €;+ €2= ¢;+ 3 be the total 
energy; pi = —p2=p and pj =—p:=p’ be the initial and final momenta. The delta 
function is eliminated in the same way as in the derivation of (64.13), and the result 


+For future reference, another form of I is 
TP’ = 4[s — (mi t+ m2)"][s — (mi — m2)’], (64. 15a) 


where s = (pi + p2)’. 
+t In an arbitrary frame of reference, 


j= x V (vi — v2)" — (v1 X v2)’. 


This expression is the same as the ordinary flux density whenever y; is parallel to v2; then j = |v: — v2|/V. 
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= I [2 Pp’ fs 
do = |My a do’; (64.19) 


in the particular case of elastic scattering, where the nature of the particles is 
unchanged in the collision, |p’| = |p]. 
This formula can be written in yet another form by using the invariant quantity 


t =(pi— piy = mi +m — 2(pipi) 
=mjt+m{—2e,e{+ 2\pi|lpi| cos 8, (64.20) 
where 6 is the angle between p, and pj). In the centre-of-mass system the momenta 


[pi] = |p| and |pi|=|p’| are determined only by the total energy «, and when e is 
given we have 


dt = 2|p||p’| d cos @. (64.21) 


Hence, in (64.19), 


P dd d(-t) 
do'= —ddd cos 6 =~, 
Y 2Ipllp' 


where @ is the azimuth of pj relative to p,.t Thus 


a! ing 
a0 647 Mj 


ae (64,22) 
where I is again the invariant (64.16). The azimuth @¢, and therefore the cross- 
section in the form (64.22), are invariant under those Lorentz transformations 
which do not change the direction of relative motion of the particles. If the 
cross-section is independent of the azimuth, formula (64.22) takes the particularly 
simple form 


_! jy, pat 
do = = |Myil’ Fr. (64.23) 


If one of the colliding particles is sufficiently heavy (and its state is unaltered by 
the collision), it acts only as a fixed source of a constant field in which the other particle 
is scattered. Since the energy (though not the momentum) of the system is conserved 
in a constant field, in this treatment of the collision process we can write the S-matrix 
elements in the form 


Sj: = i - 208(E; — E/) Tp. (64.24) 


+ Since the correct sign of the differential in such cases is obvious, we shall henceforward write 
simply dt for d(—t), and so on. 
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In the expression |S,|’, the square of the one-dimensional delta function must be 


interpreted as 
l 
[SCEs — EF) >5— 6(E; — Bit. 


Now, as in the derivation of (64.11), we change to the amplitude M,; instead of Tj, 
and obtain the following expression for the probability of a process in which one 
particle is scattered in a constant field and produces in the final state a certain 
number of other particles: 


1 d*p ! 

= —_ | ence — 
dw = 278(E; — €)|Myi’ 5; I] On) er 
Here « (= E;) is again the energy of the initial particle, pz and e, the momenta and 
energies of the final particles. The scattering cross-section is found by dividing dw 
by the flux density j = v/V, where v =|p|/e is the velocity of the particle that 
undergoes scattering. The normalization volume again disappears, and the result ts 


d*pi 


On) er (64.25) 


1 
d =72 6(E; — M,,|* =— 
oO 75 (FE; e)| AP apy LI 


In the particular case of elastic scattering, there is only one particle in the final 
state, with the same energy and the same momentum (in absolute value). Writing 


d’p'>p’d|p'| do' = lp'le' de’ do' 


and eliminating 6(e’— ¢) by integrating with respect to e’, we find the cross-section 
in the form 


I 
do = AEE |M,i|° do’. (64.26) 


Finally, if the external field is time-dependent, such as the field of a system of 
particles executing a given motion, the S-matrix also lacks the delta function of 
energy. Then S,;;=iT;; and, after the change from T;; to M,; by (64.10), the 
probability of (e.g.) a process in which the field creates a given set of particles is 


d°p| 
dw =|M;\?T] AGE (64.27) 


a 


$65. Reactions involving polarized particles 


In this section we shall show by means of simple examples how the state of 
polarization of the particles concerned in the reaction is taken into account when 
calculating the scattering cross-section. 
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Let there be one electron in the initial state and one in the final state. Then the 
form of the scattering amplitude is 


My = U' Au (= UA), (65.1) 


where u and uw’ are the bispinor amplitudes of the initial and final electrons, and A 
is some matrix, which depends on the momenta and polarizations of the other 
particles (if any) which take part in the reaction. 


The scattering cross-section is proportional to |M,;|°, and 


!? 
(i'Au)* = u'y*A*u* 

= u*Aty*u' 

= Au’, (65.2) 
wheret 

A=yA‘y’. 
Thus 
|M,i\° = (@'Au)(iAu’') 


= UU, Aum A mi- (65.3) 


If the initial electron is in a mixed (partially polarized) state with density matrix 
p, and if we wish to find the cross-section for a process in which the final electron 
is in a specified polarization state p’, the products of the bispinor amplitude 
components must be changed as follows: ujiii— pix, Um > Pim. Then 


|M,;/° = tr (p’ApA). (65.4) 
The density matrices are given by formula (29.13): 
p =2x(yp + m)(1— y*(ya)) (65.5) 


and similarly for p’. 
If the initial electron is unpolarized, then 


p =2(yp +m). (65.6) 

Substituting this expression is equivalent to averaging over the polarizations of the 
electron. If it is desired to determine the cross-section for scattering with any 
+ Since the matrix A has to be constructed, we shall note here, for future reference, the following 


easily verified ¢quations: 


He tas, eae eee P — AP va, 
vas Vegas ee (65.2a) 
ye-y vy ayy 
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polarization of the final electron, we must also put p’ = (yp'+m), and double the 
result; this operation is equivalent to summation over the polarizations of the 
electron. Thus we have 


+ > [Mg =3 tr {yp’ + m)A(yp + m)A}, (65.7) 


polar. 


where the sum is taken over initial and final polarizations, and the factor + converts 
one summation into an averaging. 

The density matrix p’ in (65.4) is a secondary quantity which essentially 
represents the properties of the detector as selecting one or the other polarization 
of the final electron, not the properties of the scattering process as such. There is 
the question of the polarization state of the electron resulting from the scattering 
process itself. If p” is the density matrix of this state, then the probability of 
detecting an electron in the state p’ is obtained by projecting p” on p’, i.e. by taking 
the trace tr (p“” p’). This will be proportional to the corresponding cross-section, i.e. to 
|M,|’. A comparison with (65.4) shows that 


p)~ ApA. (65.8) 


Since we know that p“’ must have the form (65.5) with some 4-vector a“), we need 
only determine the latter. This could be done by means of formula (29.14), but it is 
even simpler to proceed as follows. 

We have seen in §29 that the components of the 4-vector a can be expressed in 
terms of those of the 3-vector € which ts (twice) the mean value of the electron spin 
in its rest frame. The polarization states of the electrons are entirely determined by 
these vectors, and it is convenient to express the scattering cross-section also in 
terms of them. The square |M,j|* will clearly be linear in each of the vectors ¢ and ¢' 
which relate to the initial and final electrons, and its form as a function of C’ will be 


Mi =a+B-C, (65.9) 
where a and £ are themselves linear functions of €. 
The vector C in (65.9) is the particular polarization of the final electron that is 


selected by the detector. The vector ¢, corresponding to the density matrix p", is 
easily found as follows. According to the above argument, 


|Myi!’ ~ tr (p'p%). 


Since this quantity is relativistically invariant, it may be calculated in any frame of 
reference. In the rest frame of the final electron we have, by (29.20), 


p'p?~(l+a-O)1+o0: 0%). 
Hence 


[Mnf ~ 1 +o +o”, 
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and from a comparison with (65.9) 
C= Bla. (65.10) 


Thus the calculation of the cross-section as a function of the parameter C also 
gives the polarization (. 

In more complex cases, when there is more than one initial or final electron, the 
calculations are similar to the foregoing. 

For instance, if there are two electrons both initially and finally, the form of the 
scattering amplitude is 


Mg = (GA) (3 Buy) + (42Cu,)(4, Duy), 


where u;, U2 are the bispinor amplitudes of the initial electrons, and uj, u3 those of 
the final electrons. The square |M,j|’ includes terms of the forms 


|@}AusP|a5Bu,f? and (a}Au,)(asBu2)(aCu,)*(a{Dur)*. 


The former reduce to products of two traces like (65.4); the latter reduce to traces 
having the form 


tr (pi Ap; Cp3Bp2D). 


Positrons are described by amplitudes with “negative frequency” u(—p). For 
reactions involving positrons, the only difference from the preceding analysis is 
that the expressions to be used for the density matrices differ from (65.5), (65.6) as 
regards the sign of m; cf. (29.16), (29.17). 

Let us now consider the polarization states of photons participating in the 
reaction. 

The polarization of each initial photon appears linearly in the scattering am- 
plitude in the form of a 4-vector e, and that of each final photon as e*. In each case 
the 4-tensor e,e* occurs in the cross-section (i.e. in the square |M,;|*). To obtain an 
arbitrary partially polarized state, this tensor must be replaced by the four- 
dimensional density matrix, the 4-tensor p,,: 


Ces > Pur (65.11) 
In particular, for an unpolarized photon, according to (8.15), 

Pu = — 28uv (65.12) 
Thus averaging over polarizations of the photon is equivalent to contracting in 


|M;i|" with respect to the corresponding two tensor indices p, v.t 
If summation over the photon polarizations is desired, not averaging, then we 


+ The expression (65.12) as it were reduces the averaging over the two actually possible polariza- 
tions of the photon to one over the four independent directions of the four-vector e. 


256 The Scattering Matrix §66 


must replace e,e* by a quantity twice as large: 
C205 Diy: (65.13) 


The density matrix of the polarized photon is given by formula (8.17). The 
choice of the 4-vectors e’, e® which appear in this expression is usually governed 
by the particular conditions of the problem. In some cases they may be related to 
certain spatial directions in a given frame of reference; in other cases, it is more 
convenient to relate them to the 4-vectors which characterize the problem, namely 
the 4-momenta of the particles. 

In (8.17) the polarization of the photon is described by the Stokes parameters, 
which form the ‘“‘vector” & = (&, &, &). As with the electron, it is necessary to 
distinguish the polarization €” of the final photon as such from the polarization €' 
that is selected by the detector. If the square of the scattering amplitude is known 
as a function of the parameter &’: 


IMy//=a+B-&, 


then the polarization &” = B/a, exactly as in (65.10). 


$66. Kinematic invariants 


Let us consider some kinematic relations for scattering processes in which 
there are only two particles, both in the initial state and in the final state. The 
relations in question are deduced from the general conservation laws alone, and are 
therefore valid for all particles and all laws of interaction. 

The law of conservation of 4-momentum, in a general form that does not specify 
which are the initial and which the final particles, is 


dit q2+ q3t+ qa=0. (66.1) 


Here +q, are the momentum 4-vectors; two of them pertain to the incident 
particles and two to the scattered particles, the momenta for the latter being —4q,. 
Thus for two of the q, the time component qi > 0, and for two q?<0. 

The law of charge conservation must be satisfied as well as that of 4-momentum 
conservation. Here the charge may be interpreted not only as the electric charge 
but as any other conserved quantity whose sign is opposite for particles and 
antiparticles. 

For given types of particles concerned in the process, the squares of the 
4-vectors q, are the squares of the particle masses, which are fixed (q2 = m}). 
Three different reactions occur, according to the values taken by the time com- 
ponents q? and the values of the charges. These reactions may be written 


(i) 14+2>3+4, 
(Il) 1+332+4, (66.2) 
(Il) 1+432+3. 
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Here the numbers refer to the particles, and the bar over a number denotes the 
corresponding antiparticle. The change from one reaction to another, i.e. the 
transfer of a particle to the opposite side of the formula, corresponds to a change in 
sign of the corresponding time component gi and in the sign of the charge (i.e. a 
replacement of the particle by its antiparticle). The reactions inverse to (66.2) are 
also possible, of course. 

The three processes (66.2) are referred to as three cross-channels of a single 
general reaction. 

The following are some examples. If particles 1 and 3 are electrons, and 2 and 4 
are photons, then channel I represents the scattering of a photon by an electron; 
channel III is the same as channel I, since the photon is strictly neutral. Channel IT 
is the conversion of an electron—positron pair into two photons. If all four particles 
are electrons, then channel I is the scattering of an electron by an electron, and 
channels II and III the scattering of a positron by an electron. If particles 1 and 3 
are electrons, and 2 and 4 are muons, then channel I is the scattering of e by py, 
channel III the scattering of e by m, and channel II the conversion of a pair eé into 
a pair pp. 

In the discussion of scattering processes, the invariant quantities which can be 
constructed from the 4-momenta are particularly important. The invariant scatter- 
ing amplitudes are functions of these quantities ($70). 

Two independent invariants can be constructed from four 4-momenta, since, 
according to (66.1), only three of the 4-vectors q, are independent. Let these be q:, 
q2, q3. From them, six invariants can be constructed: the three squares qi, q3, q3 and 
the three products qiq2, qiq3, q2q3. But the first three are the given squares of the 
masses, and the second three satisfy one relation which follows from the equationt 


(qit q2+ q3) = qi = mi. 


In order to increase the symmetry it is, however, convenient to consider not 
two but three invariants, which may be taken as 


s =(qit qo =(q3+ qa)’, 
t = (qi t+ qa)’ = (qo + qa)’, (66.3) 
u =(qit qa) =(Qr+ Qa). 


These are easily seen to be related by 
s+t+u=h, (66.4) 


where 


h=mitms+m3t+ mi. (66.5) 


+ In the general case of a reaction involving n (24) particles, the number of functionally in- 
dependent invariant quantities is 3n — 10. There are altogether 4n quantities, the components of the n 
4-momenta qa, between which there are n functional relations qi= ma, and four given by the con- 
servation law {qa = 0. Arbitrary values can be assigned to six quantities, in accordance with the number 
of parameters which define the general Lorentz transformation (a general four-dimensional rotation). 
The number of independent invariants is therefore 4n —n —4—6=3n — 10. 
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In the principal channel (I), the invariant s has a simple physical significance. It 
is the square of the total energy of the colliding particles (1 and 2) in their 
centre-of-mass system (for p;+ p.=0, s =(e,+ €2)’). In channel II, the invariant t 
has a similar significance, and in channel III the invariant u. The three channels are 
therefore often called s, t and u channels. 

It is easy to express each of the invariants s, t and u in terms of the energies 
and momenta of the colliding particles in each channel. Let us consider the s 
channel. In the centre-of-mass system of particles 1 and 2, the time and space 
components of the 4-vectors q, are 


di = Pi = (€1, Ps), G2 = P2 = (€2, —Ps), 
(66.6) 


q3 = —p3 = (—€3, —ps), qa = —pa= (— a, ps)5 


the suffix s in p, and p,; indicates that these momenta refer to the reaction in the s 
channel. Then 


So ==E Es =e€,+&=eE3t E45 (66.7) 


Asp; =[s —(m,+ m)’J[s — (m,— m2)’], 


Asp? =[s — (m3 + m,)"I[s — (m3 — ma]; 68:8) 


2t=h—s+4p, -pi- + (mt = mm} m’), 
\ (66.9) 
2u=h—s-—4p, “pi + — (my — m2)(m}3— mi). 


For elastic scattering (m;= m3, m,.= m4), we have |p,|=|p;|, and hence €,= «3, 
£, = €4. Instead of (66.9), the simpler formulae 


t = —(p, — ps)’ = —2p5(1 — cos 4,), 
: ‘ (66.10) 
u = —2p,(1 + cos 6,) + (€1 — €2) 


are then obtained, where @, is the angle between p, and p‘. The invariant —t is here 
the square of the (three-dimensional) momentum transfer in the collision. 

Similar formulae for the other channels are found by a straightforward change of 
notation. For the t channel we must interchange s and ft, and 2 and 3, in 
(66.6)—(66.10); for the u channel, we interchange s and u, and 2 and 4. 


867. Physical regions 


When considering the scattering amplitudes as functions of the independent 
variables s, t, u (which are related only by s +t +u =h), we encounter the need to 
distinguish regions in which their values are physically permissible from those in 
which they are not. Values which can correspond to a physical process of 
scattering must satisfy certain conditions which follow from the law of con- 
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servation of 4-momentum and the fact that the square of each of the 4-vectors q, is 
a given quantity m2. 
The product of two 4-momenta 
PaPo = Mam. (67.1) 


Hence 
(da + qv)’ = (Pa + Po)” = (ma + ms)’, 
If Ga = Das Wo = Po (OF Ga = — Pas Go = — Po); OF 
(da + qo) = (Pa — Po)’ S(m, — ms)’, 
if da = Das dp = ~Py. Hence, for a reaction in the s channel, 
(m,+ m2) <s =(m3+ m4)’, 
(m,— m3) >t <(m.— my’, (67.2) 
(m,— m4)’ =u <(m.— my, 


and similarly in the t and u channels. 
To determine the remaining conditions, we form a 4-vector L which is dual to 
the product of any three of the 4-vectors q,, say 


Ly = Cyurp 19245. (67.3) 


In the rest frame of particle 1, say, we have q; = (qj, 0). Then L has only the spatial 
components L; = eoqiq5q3. Thus L is a space-like vector, and L? <0 in every frame 
of reference. Expanding L’, we obtain the condition 
qi 4q 12 413 
Q291 43 qoq3| 20. (67.4) 


341 342 qi 


This can be expressed in terms of the invariants s, t, u in a form which 1s the same 
for all channels: 


stu=as+bt+cu, (67.5) 
where 
ah = (mim3— m3mi)(mj + m3— m3— m4), 


bh = (mim3— m3m4)(mj + m3— m5— m4), (67.6) 


ch = (mim7— m3m3)(mj + mi— m5 — m3) 


(T. W. B. Kibble, 1960). 
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For a graphical representation of the regions of variation of s, t and u, it is 
convenient to use triangular coordinates in a plane, called the Mandelstam plane 
(S. Mandelstam, 1958). The coordinate axes are three straight lines which intersect 
to form an equilateral triangle. The coordinates s, t, u are measured along 
directions perpendicular to these three lines; the directions towards the interior of 
the triangle are reckoned positive, as shown by the arrows in Fig. 5. Thus each 
point in the plane has corresponding values of s, t and u which are represented 
(with the appropriate signs) by the lengths of the perpendiculars to the three axes. The 
condition s+t+u=h is satisfied on account of a known theorem of geometry, h 
being equal to the altitude of the triangle. 

Let us consider the important case where the principal channel (s) corresponds 
to elastic scattering. Then the masses of the particles are equal in pairs: 


m,=m;=™m, M, = M4 = wy. (67.7) 
Let m > mw. The condition (67.5) has 
h =2(m’+ p’), a=c=0, b =(m’*— p’y, 
so that 
sut =(m?— p’)t. (67.8) 


The boundary of the region defined by this inequality comprises the straight line 
t = 0 and the hyperbola 


su = (m?— py, (67.9) 


whose two branches lie in the sectors u <0, s <0 and s >0, u >0; the axes s =0 


Fic. 5. 


+ For example, if the point P in Fig. 5 is joined to the three vertices A, B, C of the triangle, the 
latter is divided into three triangles with altitudes s, t and u; equating the sum of their areas to that of 
the triangle ABC, we obtain the required relation. The proof is similar when P lies outside the triangle 
ABC. 
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and u = 0 are the asymptotes of the hyperbola. Instead of (67.8) we can write 
t >0, su >(m?— p’ 

or 
t<0, su<(m*— p’)’. 


Moreover, according to the conditions (67.2) we must apply the inequality s > 
(m+ p) in the s channel and u>(m+ py) in the u channel; the remaining 
inequalities are then necessarily satisfied. We thus find that channels I, II, HI (s, t, 
u) correspond to the shaded regions in Fig. 6, which are called physical regions. 

If « =O (particles 2 and 4 are photons), the lower branch of the hyperbola 
touches the- axis t = 0, and the physical regions are as shown in Fig. 7. 

If m = p, the boundaries of the region (67.8) degenerate to the coordinate axes, 
and the physical regions are the three sectors shown in Fig. 8. 

In the general case of four different masses, the equation 


stu=as+bt+cu (67.10) 


defines a third-order curve whose branches are the boundaries of the physical regions 


u=(m tu)? S= (m+ {4 


s=(m-—p)* u=(m-p)? 
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u=Q s=0 


FIG. 8. 
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LY. 
ny 


FIG. 9. 


of the three channels, as shown in Fig. 9. Let m; =m,.2=m;=2my,. Then a=b2c, 
a>0Q, b>0O. The curve (67.10) meets the coordinate axes at points on the line 
as + bt + cu = 0 (see the broken lines in Fig. 9). This line is as shown in Fig. 9a and 9b, 
depending on the sign of c. If c <0, the physical region of the u channel includes part 
of the area of the coordinate triangle. In this case, therefore, the quantities s, t and u 
may all be positive at the same time. All three branches of the boundary curve have the 
appropriate coordinate axes as asymptotes; this may be seen by eliminating one of the 
variables from (67.10) by means of the relation s + t + u = h, and then making one of 
the other variables tend to infinity. In general, the conditions (67.2) yield nothing in 
addition to the limits defined by equation (67.10). The straight lines which correspond 
to the equality signs in (67.2) do not intersect the physical regions shown by the shaded 
areas in Fig. 9; some of them touch the boundaries of these regions, corresponding to 
extreme values of the variable s, t or u in the corresponding channel. 

When the mass of one of the particles exceeds the sum of the masses of the other 
three (m, > m,+m3+ m,), a fourth reaction channel is possible, corresponding to the 
disintegration 


(IV) 132+3+4. (67.11) 
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For this channel, in the rest frame of the disintegrating particle, 


q1 = (m,, 0), q2 = (—€, —p»), 3 = (—€3, —ps), 
ds = (— £4, —ps), Eg tez;te&4= mM, P2 + p3 + py = 0. 


The invariants are 


| 
3 
+ 
ea 
| 
ho 
= 
. 


S 
t=mit+m35—2mE3, (67.12) 


u=mi+mi—2myeés. 
We then have from (67.1) 


(m3 + m4)’ <s <(m,;—-—m)’, 
(m)+ my <t <(m,— my’, (67.13) 
(m, + m3) <u <(m,— mi)’. 


Thus all three invariants are positive, and the physical region of the disintegration 
channel is within the coordinate triangle. 


PROBLEMS 


PROBLEM |. Find the physical regions for the case of three equal masses: m; = m, m2 = ma= ma=u 
(for example, the reaction K+ 7—>7+ 7). 


SOLUTION. Equation (67.10) becomes 
stu = "(m*— py’, (1) 
with 
stttu=3p +m’. 


Regions I, IT and II] are bounded by curves of the same shape, with s > 0,t <0, u <0 for region I, and so on. 
If m > 3, equation (1) also has a branch in the form of a closed curve with s > 0, t > 0, u > 0, which bounds 
the region of channel IV (Fig. 10). 
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PROBLEM 2. The same as Problem 1, but for the case m; = m, m2 = p, m3 = m4=0,m > p (forexample, 
the reaction w+v->e+ yp). 


SOLUTION. The condition (67.5) becomes 
stu >m’p’s, 


with s+t+u=m’+ yp’. The physical regions are bounded by the axis s = 0 and the two branches of the 
hyperbola tu = m7’ (Fig. 11). 


PROBLEM 3. The same as Problem 1, but for the case m1 = m3=m, m2= 0, m4= yp, with m > 2p (for 
example, the reaction p+ y>p +7’). 


SOLUTION. The boundary equation (67.10) becomes 


stu=a(s +u)+ Dt, 
ah=m*p*, bh=m‘Qm’-p’), h=2m? +p’. 


Elimination of u gives 
t? + (C445 -h)1+ 2-0 
S S 


For a given value of s, this is a quadratic in t. If s > (m+ pw) (the region of the s channel), there are two 
negative values of t for each value of s. If s =(m+ p)’, these two roots of the quadratic coincide at 
t =—mp’/(m + p). The boundary of the s channel region is then as shown in Fig. 12. The lower branch of the 
boundary tends asymptotically to the axis u =0, and the upper branch crosses this axis at the point 
t= p(w? —m’). 

The u channel region is symmetrical with the s channel region; the t channel region is situated as shown 
in Fig. 12. 


§ 68. Expansion in partial amplitudes 
An important step in the analysis of a reaction of the form 
at+tboc+t+d (68.1) 


is the expansion of the scattering amplitude in partial amplitudes, each of which 
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Fic. 12. 


corresponds (for a given total energy «) to a definite value of the total angular 
momentum of the particles J in their centre-of-mass system.f 

These partial amplitudes are, therefore, elements of the S-matrix in the angular 
momentum representation: 


(eJ'M'|S|eJM). 


Since the angular momentum J and its component M along a specified z-axis are 
conserved, the S-matrix is diagonal with respect to these numbers (and also with 
respect to the energy <). Because of the isotropy of space, the diagonal elements are 
independent of the value of M. For given J, M and «, the scattering matrix is still a 
matrix with respect to the spin quantum numbers; the elements of this matrix will be 
written in a more concise form: 


(eJMAX'|S|eJMA) = (A'|S7(e)IA), (68.2) 


where A and X’ are the sets of spin quantum numbers. These can most naturally be 
taken to be the helicities of the particles. The helicity, unlike the spin component along 
an arbitrary axis in Space, is conserved for a free particle, and it commutes with both 
the momentum and the angular momentum of the particle ($16). The helicities may 
therefore be used in both the momentum and the angular momentum representation of 
the scattering matrix. 

The elements of the S-matrix with respect to the helicity indices will be called the 
helicity scattering amplitudes, and X, A' will be taken to include the helicities of the 
initial and final particles respectively: 


A= (Xa; Xp), x! = (Ac, Xa). 
In the momentum representation, the scattering matrix elements are defined with 
respect to the states |enA) (where n = p/|p| is the direction of the momentum of relative 


motion in the centre-of-mass system); in the angular momentum representation, they 


+ Most of the results in §§68 and 69 are due to M. Jacob and G. C. Wick (1959). 
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are defined with respect to the states |eJMd). They can be related by means of the 
expansions 


\JMn) = i Ind)(MALTMA) dog, (68.3) 


where the integration is over the directions n; the energy « is, for brevity, omitted from 
the state symbols. Since this transformation is unitary (see QM, § 12), the coefficients 
of the inverse transformation are 


(JMA|nA) = (nv|JMA)*, (68.4) 


By the general rule of matrix transformation, the same coefficients give the relation 
between the S-matrix elements in the two representations: 


(n’d'|Sina) = S Un’a\IMX'XIMA'|S|IMA)(IMA [nd ). (68.5) 
J,M 


The coefficients in the expansion (68.3) are easily found by means of the results of 
$16. Let the wave functions of all states be expressed in the momentum represen- 
tation, 1.e. as functions of the direction of the momentum (for a given energy); this 
direction, as an independent variable, will be denoted by v to distinguish it from the 
direction nas a quantum number of the state. In this representation, the wave function 
has the form (16.2): 


Wav) = US — n). (68.6) 
When (68.6) is substituted in the expansion (68.3), the latter reduces to a single term: 
Win = WA\TMA)u™. (68.7) 


The helicities A, and A, of the two particles are defined as the components of their 
spins in the directions of their respective momenta. If the momenta are p, =p, 
p» = —p, then these directions are n for the first particle and —n for the second particle. 
If now the system is regarded as a single particle with helicity A in the direction n, then 
A=A,—Ap. Its wave function (in the momentum representation) can be written, 
according to (16.4), in the form 


nao) = «DR (wy ft (68.8) 


Comparison of (68.7) and (68.8), with the variable v replaced by n, gives the required 
coefficients: 


(nA|JMA) = (a4! DY4(n). (68.9) 
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Substituting these coefficients in (68.5), we have 
(n’d'|S|na) = D 21 Dewy DEYany(r’|S!|a), 
fim 41 (68.10) 
A =Xa—Aps A’ =). — Na, 


with the abbreviated notation (68.2). If the direction n is taken as that of the z-axis, 
then 

DXi (n) = 0am, 
and (68.10) becomes 


(n’'A'|S{nd) = > a DYN’ KAS? |X). (68.11) 


J 


We see that the expansion in partial amplitudes has the functions DY as 
coefficients. For a reaction of the form (68.1), it is convenient to define the scattering 
amplitude f in such a way that the cross-section (in the centre-of-mass system) is 


do =\(n'd'If|ma)P do’; (68.12) 


by comparison with (64.19), we can relate this amplitude to the matrix element M,;. The 
expansion of the amplitude in partial amplitudes may be written 


(n’A'|flmd) = 3) 2S + I)Dire(n')DRM* nya 'If"|A), Oe) 


or, taking the z-axis in the direction n, 
(n’A'|f[nd) =) (2 + HDYAM’)\A'|f7 A). (68.14) 
J 


This is a generalization of the usual expansion in partial amplitudes for the scattering 
of spinless particles; see QM, (123.14). Since D}y’ = Pi(cos 6), (68.14) reduces in the 
case of zero spins to an expansion in Legendre polynomials 


f(0) => QL + lfiPL(cos 8). 


The cross-section (68.12) is valid when all the particles have definite helicities. If 
they are in mixed polarization states, the cross-section is found by averaging the 
product 


(AcAalf[Aadn XA CA alf[A cA )* 
over the polarization density matrices of the particles, 


(apa Aw PO IABMKA Up Ac KA ap |Aa)s 
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see the first footnote to 848. For example, in a reaction between unpolarized particles 
a.b to form unpolarized particles c, d, we have 


use / 
TapF Iw FH 2 py CF + DAS +1)x 


x (AcAalf7 |Aade KAcAaLf 7 |AaAn)*D VAC) DY A* (1); (68.15) 


do = 


the z-axis is along n, and the first summation is over Aq, Ap, Acs Ag. Using QM, (58.19) for 
the function D¥{* and then the expansion QM, (110.2), we have finally 


do A—A’ ! 
= —] 2J + 1)2J'+ 1) x 
do (2S, “te 1)(2s, I 1) ee ( ) ( M ) 


x (AcAalf? [Aads) (AcAalf 7 |Aadn)* X 


‘i men J’ 


J L 
XS (QL+1 ( oe )P @), 68.16 
> ( ) a 0 A aah 0 L(cos ) ( ) 


where @ is the angle between n’ and the z-axis; the summation with respect to L 1s over 
all integers which can occur when J and J’ are added vectorially. The expansion of the 
scattering amplitude in partial amplitudes gives a full expression of all properties of 
the angular distribution of scattering that are due to the symmetry with respect to 
Spatial rotations. But it does not explicitly reveal the properties that are due to the 
symmetry with respect to spatial inversion. The P invariance (if possessed by the 
interaction) leads to certain relations between the various helicity amplitudes (see 
$69). 


$69. Symmetry of helicity scattering amplitudes 


The conditions imposed by the symmetry with respect to the transformations P, T, 
C (if, of course, the particle interaction process in question in fact possesses such 
symmetry) lead to certain relations between the helicity scattering amplitudes, and 
therefore reduce the number of independent amplitudes.f 

To establish these relations, we shall first determine the symmetry properties of 
the helicity states of a system of two particles. 

Let us consider the particles in their centre-of-mass system. One particle has 
momentum p,; =p and helicity A, with respect to the direction of p; the other has 
momentum p2 = —p and helicity A. with respect to the direction of —p. If the helicity is 
defined with respect to the same direction, that of p, its values are A, and —A2, and the 
particles will thus be described by plane waves with amplitudes u@” and ut”. The 
two-particle system is described by a (multi-component) function u$”? formed from 
the products of the amplitudes u?” and ut”. 

Let us next regard the system as a single particle with helicity A = A, — A2 in the 
direction n=p/|p|; we can then write the wave function (in the momentum 


+ This number does not, of course, depend on the specific representation of the matrix S’,andis the same 
for any choice of the spin variables. 
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representation, i.e. as a function of n) for a state with definite values of J, M, A, A (and 
of the total energy «): 


denovna = uo"? Kn) AH, = Ase (69.1) 


cf. (68.8). Since A is the component of the total angular momentum in the direction of 
p, we must have 


|A| SJ. (69.2) 
According to (16.14), under inversion 


Pu?”?(n) = qrqu®??(—n) 
= Minl—1)see rey OM (py, (69.3) 


where 7; and 7 are the internal parities of the particles. Using also (16.10), we find the 
transformation law for the functions (69.1): 
Pin = minn(— 1) raya" (69.4) 
If the two particles are identical, the question arises of the symmetry with respect 
to their interchange. This interchange implies interchanging their momenta and their 
spins. To show the significance of this operation as applied to the function (69.1), we 
note that its definition contains an asymmetry, in that the angular momenta of the two 
particles are projected on the direction of the same vector p; = p, the momentum of the 
first particle. After the interchange, this vector is replaced by p.=~—p, and the 
components of the angular momenta j, and j, along this vector are —A, and A; (instead 
of 4; and — A, along p). The result of applying the particle interchange operator Py to 
the function (69.1) may therefore be written 


“ eA 23 4+1 
Prom, = uO? (—n) Din) nl 2 ) 


where again A = A;— A2. Then, using (69.3) and (16.10), we find that 


Pista, = (-1)°"Ymn,a,> (69.5) 


where §S,;= 5.9=S. 

For identical particles, the permissible states must be either symmetric (for 
bosons) or antisymmetric (for fermions) with respect to interchange. Since the former 
case occurs when the particle spin s is integral and the latter case when it is 
half-integral, in either case the permissible helicity states of the two-particle system 
can be written as linear combinations 


[1+ (-1)*Proldian ays 


QE4-S 
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or, according to (69.5), 
Waar, + (-1)’ bmn, a,- (69.6) 


It is noteworthy that this combination is the same for both bosons and fermions. 
For a particle—antiparticle system, the result of the interchange is expressed by the 

same formula (69.5), but, unlike the case of identical particles, states of either 

symmetry under interchange are here permissible, i.e. both combinations 


Ww = Wma, + (- 1) "dasa, (69.7) 


can occur. These states have certain charge parities C. The operation of charge 
conjugation may be regarded as the result of a total interchange of all variables (spin 
and charge) of the two particles, followed by reverse interchange of the spin variables 
(helicities). The result of the first operation must be the same as that of interchange ina 
system of two identical particles. Hence it is clear that, with the upper sign in (69.7) 
(which is the same as the sign in the state (69.6) permissible for identical particles), the 
system will be charge-even, and with the lower sign charge-odd: 


Cum = + y*. 


Finally, let us consider the operation of time reversal. The wave function of a 
particle at rest with spin s and component thereof o is transformed according to 


Tse = (-1)°'s-03 


see QM, (60.2). The wave function of two particles in their centre-of-mass system may 
also be regarded (in respect ot its transformation properties) as that of a “‘particle”’ at 
rest, with angular momentum J and component thereof M. The helicities \, A. are 
unchanged: time reversal changes the sign of the momentum and angular momentum 
vectors, and the products j: p are therefore unaffected. Hence 


Tap, = (— 1) daa, (69.8) 


We can now write down immediately the symmetry relations for the helicity 
amplitudes. 

If the interaction is P-invariant, then for the reaction a + b >c + d the amplitudes 
of the transitions 


[AgAp) > |AcAg) and PIAgA,») > PlAcAg) 


must be the same (for given J and e). Hence, using (69.4), we find 


(AcAa]S"|Aady) = ae 1) he als | As SA) (69.9) 


a 
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If states with definite parities, i.e. the combinations 


1 ; 
V2 (Womaary + Pdima,a,)s 


where Aj, Az = AgAp OF Ac, Ag, are Chosen instead of those with definite helicities, then 
the amplitudes of transitions in which parity is not conserved are zero. 

Time reversal transforms each state in accordance with (69.8), and also inter- 
changes initial and final states. Thus T invariance leads to the relations 


(AcAa]S7(e)|Aady) = (Aadv|S’(e)|AcAa). (69.10) 


These two amplitudes, however, pertain to different processes, the direct and 
reverse reactions. These two processes are essentially equivalent only in the case of 
elastic scattering, and (69.10) is then a relation between helicity amplitudes for the 
same reaction. 

In elastic scattering of two identical particles, the number of different amplitudes is 
further reduced because of the symmetry with respect to interchange. We have seen 
that, for a given J, the states which occur are either all symmetric or all antisymmetric 
in A,, Ax The conservation of angular momentum therefore implies that of the 
symmetry with respect to interchange of helicities. 

A similar situation occurs in the elastic scattering of a particle by its antiparticle, or 
the conversion of one particle-antiparticle pair into another, i.e. a reaction 
a+a—>b+b. For given J, there are both symmetric and antisymmetric states with 
regard to Ai, Az, but they correspond to different values of the charge parity of the 
system. Hence it follows that, if the interaction of the particles is C-invariant, so that 
the charge parity 1s conserved, transitions between states of different symmetry with 
regard to A;, Az are forbidden.t It must be emphasized, however, that there is a 
difference from the case of identical particles, in which states of one symmetry are 
entirely absent for any given J. In the “particle—antiparticle” case, only transitions 
between states of different symmetry are forbidden; the states themselves exist for 
every J. 

Because of the universal CPT invariance, the existence of T invariance implies 
that of CP invariance. The latter brings about the equality of amplitudes for two 
reactions, one obtained from the other by replacing all particles by antiparticles (and 
changing the sign of the helicities): 


(Acda|S” |Aado) = (AzAa|S” |Aadz), (69.11) 


where A; = —A, and so on.t 
The number of independent amplitudes is the same for all the cross-channels of 


+ A similar prohibition can also arise from isotopic invariance of the interaction of non-identical 
particles. For instance, transitions between states of different symmetry with regard to Aj, A2 are forbidden, 
to the extent that this invariance holds, in the scattering of a neutron by a proton. 

+t Since these two amplitudes relate to different reactions, interference between which is not possible, 
the phase factor in (69.11) would have no significance, and can be taken as unity. Only the equality of 
cross-sections which follows from (69.11) is actually meaningful. 
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one generalized reaction, and therefore this number can be determined from any 
channel. For example, the elastic scattering a+b—-a+b and the annihilation 
a+a—-b+b are described by the same number of independent amplitudes. The 
restrictions imposed by T invariance in the first case are equivalent to those imposed 
by C invariance in the second case. 

Let us also consider a reaction in which one particle disintegrates into two: 
a—>b+c. In the centre-of-mass system (the rest frame for particle a), we have 
pp = —p-. Scalar multiplication of the equation j, = j,+j. by pp gives 


Na = Ay — Ac (69.12) 


(the helicity A, of particle a is defined as the component of its spin in the direction of 
the momentum of one of the secondary particles). This relation can be regarded as a 
consequence of the additional symmetry present in the process considered, namely 
the axial symmetry about the directions of p, and p.. If the spin s, of particle a is less 
than s, + s,, the relation (69.12) reduces the number of possible sets of values of Aq, Ap, 
\, and therefore the number of independent helicity amplitudes of the disintegration. 
The total angular momentum J is then equal to the spin s, of the primary particle, and 
is consequently fixed. 
The P invariance in the disintegration is expressed by the relation 


(Avdc|S"[Aa) = ae Sper =hi SHA), (69.13) 


where we have used (69.4) and also the transformation (16.16) for the wave function of 
a single particle. 

If the primary particle is strictly neutral, further limitations arise if C parity is 
conserved. Three cases are to be distinguished here. If the disintegration products are 
also strictly neutral, we must have C, = C,C,; this condition either prohibits the 
disintegration altogether, or is satisfied and causes no further restriction. If the 
particles b and c are different, then C invariance implies a relation between the 
amplitudes of the different processes a>b+é and a>b+c. Finally, for the 
disintegration a > b + b, there is a restriction because, for a given charge parity C and 
a given total angular momentum J = s,, the system may be in states either symmetric 
or antisymmetric with respect to the helicities, depending on the parity of the number J 
and on the sign of C. 

CP invariance implies the equality of amplitudes for the disintegrations a>b+c 
and d—>b+é: 


(Apde|S?|da) = (AsAe|S? [Aa), (69.14) 


where Az = —A, and so on; i.e. it implies equal probabilities of disintegration for the 
particle and the antiparticle. If the particle can disintegrate in more than one way 
(through various channels), this equality applies to each channel. This conclusion, it 
must be emphasized, is based on the existence of CP invariance, which is not a 
universal property of Nature. Only CPT invariance is universal, and this by itself 
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would lead only to the equation 
(ApAc|S7|Aa) = (AalS”|ApAz), 


in which the right-hand side refers to the process inverse to disintegration. We shall 
see later ($71) that the condition of CPT invariance, together with unitarity 
requirements, does lead to a relation, although a more restricted one, between the 
disintegration probabilities for the particle and the antiparticle. 


PROBLEMS 


PROBLEM 1. Using (69.6), obtain a classification of the possible states of a two-photon system. 


SOLUTION. In this case A1, A2 = £1. Foreven J (>0), according to (69.6), three states symmetric in A1, 
d2 are allowed: 


(a) dou, (b) dm.-1.-1, (c) Wimi-1+ Wm-1,1- 
For odd J (>1), one antisymmetric state is allowed: 
(d) ami,—-1 — Wm,-1,1- 


States (c) and (d) also have a definite parity (+1): according to (69.4), 
P(Wim1,-1 + wbm-11)= + (—1)' (demi -1 ~ Wom,-1,1)5 


the factor +(—1)’ = 1, since the upper sign refers to even values of J and the lower sign to odd values. States 
(a) and (b) themselves have no definite parity, but even and odd states are obtained by taking the 
combinations 


(a') Wm + Wim,-1,-1, (b’) Wamii — Wum,-1,-1- 


When J = 0, only A; = Azis allowed by the condition |A; — A2| < J, so that state (c) does not occur, leaving one 
even and one odd state, (a’) and (b’). Finally, if J = 1, state (d), which is the only possible state for odd J, is 
forbidden because it has A = 2>J. Thus we arrive at the table (9.5) for the permissible states. 


PROBLEM 2. In the non-relativistic approximation, the total angular momentum J of the system is 
found by adding the spin S and the orbital angular momentum L. For a system of two particles, find the 
relation between the states |JLSM) and |JM)1A2). 


SOLUTION. According to the rule for constructing wave functions when adding angular momenta, 
we have 


WitsM = X{Ws,o,Ws0(0102|SMs)}rm,(MiMs|JM), (1) 


where i/o are the eigenfunctions of the spin s with component o along a fixed z-axis, Wm, those of the 
orbital angular momentum L with component M_; the expression in the braces corresponds to the 
addition of s; and s2 to give S, after which S is added to L to give J; the summation is over all m-type 
indices. Let all functions be expressed in the momentum representation, as functions of the direction n 
of the momentum p= pi, and let the functions ys. be expressed in terms of the functions w,, of the 
helicity states by means of QM, (58.7): 


Uso = > DEY.) vary, 


(s2) 
Ws. = D2, D-r.0,()Wn,-r>- 
202 2 A2 A Ww A2 
For the function wim,, we have 
wim, = Yim,(n) 


; 2L+1 
= re , ae D ir (I), 
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using QM (58.25) and the definition (16.5). Substituting these functions in equation (1), and twice using the 
expansion QM, (110.1), together with the orthogonality of the Clebsch-Gordan coefficients (QM, (106. 13)), 
we obtain the expansion 


WILSM = pa Wmn,a4JMA1A]JLSM ), (2) 
where 
23 +1 
WIMA, A> = War, Wn,—r.D Yir(n) J _ ; A=Ai1- A2, 
and the coefficients are 
MAXIA a 6 La SO SaES a $2 A S :) 
(JMX1A2|JLSM) = (-i)"(-1) VIQL+ D2S+ D] he . =o. (3) 


Since the transformation (2) is unitary, we have 


(JLSM|JMX1\2) = (JMA1A2\JLSM)*. 


§70. Invariant amplitudes 


In the helicity amplitudes, a particular frame of reference is used, namely the 
centre-of-mass system. But, in order to calculate the scattering amplitudes by means 
of invariant perturbation theory (and also to examine their general analytical 
properties), it is convenient to write them in an explicitly invariant form. 

If the particles concerned in the reaction have no spin, the scattering amplitude 
depends only on the invariant products of the 4-momenta of the particles. For a 
reaction of the form 


a+b->c+d, (70.1) 


these invariants may be taken as any two of the quantities s, t, u defined in $66. Then 
the scattering amplitude reduces to a single function M;; = f(s, t). 

If the particles have spins, then, besides the kinematic invariants s, t, u, there are 
also invariants which can be constructed from the wave amplitudes of the particles 
(bispinors, 4-tensors, etc.). The scattering amplitudes must then have the form 


My a S fn(s, t)F,,, (70.2) 


where the F, are invariants which depend linearly on the wave amplitudes of all the 
particles concerned (and also on their 4-momenta). The coefficients f,(s, t) are called 
invariant amplitudes. 

By choosing the wave amplitudes in such a way as to correspond to particles with 
definite helicities, we obtain definite values of the invariants F, = F,,(\j, Az). Then the 
helicity scattering amplitudes are linear homogeneous combinations of the invariant 
amplitudes f,. Hence we see that the number of independent functions f,(s, t) is equal 
to the number of independent helicity amplitudes. Since the latter number is easily 
determined, as shown in §69, this makes easier the construction of the invariants F,, 
their number being known in advance. 

Let us consider some examples, assuming in every case that the interaction is 
T-invariant and P -invariant. The latter property implies that the invariants F,, must be 
true scalars, not pseudoscalars. 
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SCATTERING OF A PARTICLE WITH SPIN 0 BY ONE WITH SPIN } 


To find the number of invariants (that is, the number of independent helicity 
amplitudes), we note that the total number of elements of the matrix S’ (i.e. of 
different sets A1, Ax, Aj, A4) is in this case four: A, = A, = 0, Az, AS = +3. When the P 
invariance 1s taken into account, the number of independent elements is reduced to 
two, and this is unchanged by the inclusion of T invariance. 

The two independent invariants may be taken as 


F, = w'u, F,= u'(yK)u, (70.3) 


where u = u(p), u’ = u(p') are the bispinor amplitudes of the initial and final fermions; 
K =k+k’', where k and k’ are the 4-momenta of the initial and final bosons.t 

The T invariance of the quantities (70.3) is evident if we note that under time 
reversal the products #’u and i'y“u are transformed according to the same rule (28.6) 
as the operators wih and wy“, whose matrix elements they are: ii'u is invariant, and 
the 4-vector u’yu is transformed according to 


u'y°’u > i'y°u, i’yu > — i'yu. 


The 4-momenta are transformed similarly: (K°,K)—>(K°,—K), and the scalar 
product F, = K,(u’y“u) is therefore invariant. 


1 
ELASTIC SCATTERING OF TWO IDENTICAL PARTICLES WITH SPIN 5) 


To find the number of independent helicity amplitudes, it is convenient to start 
from linear combinations of the helicity states: 


Wig = Wy Yay Wg = Wa = w__. 
Ure = Wi + ws, Yu = Wr — Ws, 


where the suffixes + denote the values +3 of the helicities of the two particles. The 
states 1g, 2g, 3g are even, and u is odd, with respect to interchange of the particles. 
The transitions g<>u are forbidden, so that there remain 16—6= 10 matrix ele- 
ments, when the interchange symmetry is taken into account. The functions WW, 
and w,, and ys, have opposite parities with respect to the inversion P; the 
prohibition of transitions between them reduces the number of independent am- 
plitudes to six. Lastly, the T invariance equalizes the amplitudes of the transitions 
lg>3g and 3g—1g, leaving only five independent amplitudes. The five in- 
dependent invariants may be taken as 


F, = (Uu;)(Uzu), F,= (iy us)(asy° Uy), 


FP; = (iy"u)(ay,.U2), Fi = (aity“yu)(y,y ud), (70.4) 
Fs o— (ajo""U))(i130,,,U2), 
+ At first sight, there might appear to be another invariant of the form i’o,,.k"k'’u (with the matrices oy, 


defined by (28.2)), but this is easily seen to reduce to the invariants (70.3) by means of the conservation law 
k'=p+k-—p’ and the equations (yp)u = mu, i'(yp') = mii’ satished by the bispinor amplitudes. 
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where u,, uy are the bispinor amplitudes of the initial particles and uj, u; those of 
the final particles. Interchange of the initial (or of the final) particles gives no new 
invariants: the invariants obtained can be expressed in terms of the previous ones 
(§28, Problem). But the expression (70.2), with the F, given by (70.4), does not 
explicitly take account of the requirement that interchange of two identical fer- 
mions must change the sign of the scattering amplitude. An expression which 
satisfies this condition may be written 


Myi = [Cui )(iau2) fi(t, uw) — (ui )(U Ua) fi(U, t)]+ +>. (70.5) 


When p; and p; (or p; and p>) are interchanged, the kinematic invariants s > s, t > u, 
u — t, so that the condition is necessarily satisfied. 


ELASTIC SCATTERING OF A PHOTON BY PARTICLES 
WITH SPIN 0 OR } 


The amplitude of this process is conveniently expressed by means of the 
space-like unit 4-vectors e", e® which satisfy the conditions 


et = gO 1 eDe2 — 9, 
(70.6) 
ek = eMk =0, eve Sek S00: 


for each of the two photons, these 4-vectors can be the unit 4-vectors by means of 
which an invariant description of their polarization properties is obtained (88). 

Let k and k’ be the initial and final 4-momenta of the photon; p and p’ those of 
the scattering particle. The 4-vectors 


+ / 
P*=p*+p"-—K* pap 


N’ =e” P_g.K,, 


> 


(70.7) 


where 
K=k+k’, q=p-—p'=k'-—k, 


are evidently orthogonal to one another and also to the 4-vectors K and q, and 
therefore to k and k’. Being orthogonal to the time-like 4-vector K (K* = 2kk’ > 0), 
they must themselves be space-like: in a frame of reference for which K = 90, it 
follows from KP =0 that P)=0 and hence P’?<0. Normalizing P and N by 
putting 


pices N* er — Pp’ (70.8) 
(—N?)’ V(—P’y 


we obtain a pair of 4-vectors which have all the required properties. It may be 
noted that e® is a true vector and e" a pseudovector. 
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The photon scattering amplitude may be written 
My = Fej*e,, (70.9) 


in terms of the polarization 4-vectors e and e’ of the initial and final photons. 

The photon helicity has only two values, +1. Hence, for the scattering of a 
photon by a particle with spin zero, the number of independent helicity amplitudes 
is the same as for the mutual scattering of particles with spin 0 and 3, namely two. 
The tensor F™ in (70.9) has to be constructed from the particle 4-momenta only. It 
can be written 


pw ae fete ae peer or. (70.10) 


where f; and f are invariant amplitudes. It should be noted that no term containing 
a product e*e®# can appear in F™, since this product is a pseudotensor and 
would give a pseudoscalar on substitution in (70.9). 

Lastly, let us consider the scattering of a photon by a particle with spin 3. To 
find the number of independent helicity amplitudes, we note that the total number 
of elements of the matrix S’ in this case is sixteen; the helicity of each of two 
initial and two final particles has two values. The condition of P invariance reduces 
this number to eight, and that of T invariance brings it down to six. 

Here, we write the tensor F,, in the form 


where Gp and G; are true scalars, G; and G» are pseudoscalars, and all four are 
bilinear in the bispinor fermion amplitudes &(p'’) and u(p), being of the form 


G, = U(p)Qnu (Pp). (70.12) 
The general form of the matrices (with respect to the bispinor indices) Q, is 


Qo= fit filyBK), Q,= y'(f3+ falyK)), 
Q=y(fs+flyK)), Q:=f7+ felyK), 


where K =k +k’. The coefficients f,,..., fg are invariant amplitudes, in this case 
eight in number (instead of the correct value of six), because the condition of T 
invariance has not yet been imposed. 

Time reversal interchanges the initial and final 4-momenta of the particles, and 
also changes the sign of their space components: 


(70.13) 


(ko, k)<>(ko,—k’), (Do, p)<> (Do, — p’)- (70.14) 
The photon polarization 4-vectors are transformed according to 


(€y, e)<>(e4*, — e'*) (70.15) 
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(cf. (8.1la)); hence 


(eo* eo, €}* eo, EF ex) > (e0* eo, — €0*e:, €1*e;). 


By virtue of the last transformation, the condition of invariance of the scattering 
amplitude (70.9) is equivalent to 


(Foo, Fio, Fix.) > (Foo, — Foi. Fi). 
On the other hand, the changes (70.14) imply 


(Ko, K) > (Ko, — K), (qo, 4) > (—4o, q), 
(Po, P)=> (Po, P), (No, N) > (No, —N), 


so that 

(e§”, a) > (ef? — a), (70.16) 
Hence, from (70.11), we must have 

Go, 1,3 Go, 1,3, G2, — G. 
Under time reversal, 
i'yu>—i'yu, — d'y(yK)u>ii'y(yK)u, 

aS is evident from the transformation laws for pseudoscalar and pseudovector 
bilinear forms (28.6). From (70.12), (70.13) it is now evident that, because of the T 


invariance of the scattering amplitude, 


fr=fe=9. (70.17) 


§71. The unitarity condition 


The scattering matrix must be unitary: SS* = 1, or in terms of matrix elements, 
(SS*) ji oo > SinStn = Ofi, (71.1) 


where the suffix n labels the possible intermediate states.t This is the most general 
property of the S-matrix, which ensures that the orthonormality of the states is 
preserved in the reaction; cf. QM, 88125 and 144. In particular, the diagonal 
elements of equation (71.1) simply express the fact that the sum of the transition 


+ The actual meaning of 6, in (71.1) depends, of course, on the specific choice of quantum numbers 
and on the normalization of the wave functions of the system. It must be defined so that =; 5, = 1. 
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probabilities from a given initial state to all final states is unity: 


» |Snil? aie 


Substituting in (71.1) the matrix elements in the form (64.2), we obtain 


Ty: — Tk; = i(27)* D6 (P; — Pr) Tin T in 


= i(2a)* ¥ 8 (P; — Pa)T% Tri. (71.2) 


The two equivalent forms on the right are obtained by writing the unitarity 
condition respectively as SS* = 1 and S*S = 1, with opposite orders of the factors § 
and S*. 

It should be noticed that the left-hand side of the equation is linear in the matrix 
elements of T, but the right-hand side is quadratic. If the interaction contains a 
small parameter (e.g. the electromagnetic interaction), the left-hand side is there- 
fore of the first order of smallness and the right-hand side is of the second order. 
The latter may consequently be neglected in a first approximation; then 


Ty = T¥;, (71.3) 


i.e. the matrix T is Hermitian. 

In order to make the unitary condition (71.2) more specific, we must understand 
precisely what is meant by the summation over n. Let us do this for a two-particle 
collision, assuming that the conservation laws allow only elastic scattering. Then all 
the intermediate states in (71.2) are likewise “two-particle” states. Summation over 
these signifies integration over the intermediate momenta p{, p3, and summation 
over the spin quantum numbers (for example, the helicities) of the two particles, 
which we denote by X”: 


V V'd’pid’p3 p 
2 = (27)? 2 


Eliminating the delta functions in the same way as in §64, we obtain the “‘two- 
particle” unitarity condition in the form 


eS T= oye Rl 7,.Theetes do", 


where p is the momentum and « the total energy in the centre-of-mass system. The 
normalization volume does not appear after changing from the amplitudes Tj; to Mg 
in accordance with (64.10): 


=s = l el f ” 
M; — M*; ny» M,,M*,,do”. (71.4) 
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Let the elastic scattering amplitude be defined so that 
do = |(n'd'|f|ma))? do’, (71.5) 


where n and n’ are the directions of the initial and final momenta, A and A’ the 
initial and final spin quantum numbers. Comparison with (64.19) shows that 


ry! as 
(n’A'|f|nA) = eo Mii, (71.6) 
and the unitarity condition (72.4) becomes 
(n'A'|f|mA) — (nA |f[n’A')* = ‘Pl Ss i (n’A'|f [nA "nA |f[n"A")* do", (71.7) 
% 


which generalizes the familiar formula of the non-relativistic theory, QM (125.8). 

The ‘“‘amplitude of zero-angle elastic scattering” is the diagonal matrix element 
Ti, in which the final states of the particles are the same as their initial states.t For 
this amplitude the unitarity condition (71.2) becomes 


2im Tj = (27)* > |Tin PSP; — Pr). (71.8) 


The right-hand side of this equation differs only by a factor from the total 
cross-section for all possible processes of scattering from the given initial state i. 
For let this cross-section be denoted by o;; then summation of the probability (64.5) 
over states f and division by the flux density j gives 


4 
ie ea S |Ta[?8 CP; — Px), 


whence 
(2V/j) im Tj = 0%. 


The normalization volume is eliminated by putting Tj, = M;/(2e, V -2e2V) (where & 
and e, are the energies of the particles in the centre-of-mass system) and substitut- 
ing j from (64.17): 


im Mi = 2|pleo:. (71.9) 


This formula expresses the optical theorem. If the elastic scattering amplitude 
(71.6) is used, the theorem takes the customary form 


im{na |f|nA) = |plo./4a; (71.10) 
cf. QM, (142.10). 


+t It must be stressed that the matrix elements of T are concerned, not those of S; that is, the 
diagonal element is taken after subtracting the unit matrix from S. 
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If the S-matrix is given in the angular-momentum representation (partial am- 
plitudes), it is diagonal with respect to J, and the unitarity condition can therefore 
be written separately for each value of J. 

For example, if only elastic scattering is possible, the unitarity condition is 


> (ASTIN MALSZIA)* = By (71.11) 


Because of the T invariance, the elastic scattering matrix is symmetric (cf. (69.10)), 
and hence can be reduced to diagonal form. The unitarity condition then requires 
that the diagonal elements should be of unit modulus, and they are customarily 
written in the form 


Si = exp(2id;,), (71.12) 


where the 6,, are real constants, depending on the energy (the suffix n labelling the 
diagonal elements for a given J). In the general case, when the number N of 
independent amplitudes exceeds the order of the (square) matrix S’, the coefficients 
of the transformation which diagonalizes S’ depend on J and E (these coefficients 
then comprise not only the principal values of the matrix but also independent 
quantities equivalent to the original N quantities). If, however, the number N is 
equal to the order of the matrix S’ (and therefore to the number of its principal 
values), the diagonalization coefficients are universal constants, and the diagonaliz- 
ing states have definite parities (but not, of course, definite helicities). 
The condition (71.11), expressed in terms of the partial amplitudes (A'|f’|A), is 


(NIE TAD — AL fE]AY* = 2ilp| 2 (NLP A ALAA, (71.13) 


as is easily seen by substituting the expansion (68.13) in (71.7) and using the 
orthonormality of the D functions. If there is T invariance, the matrix (A'|f’|A) is 
symmetric, and (71.13) becomes 


im (X']f"|A) = [pl a'lff A). (71.14) 


If the matrix is diagonalized, the diagonal elements are 
fi= ze (e75, — 1) = Le, sin 5;,. (71.15) 
2i|p| Ip| 


Finally, we may mention some consequences which follow from the unitary 
condition together with the requirement of CPT invariance. The latter shows that 


Ty = Ty, (71.16) 


where i and f are states which differ from i and f in that all the particles are 
replaced by antiparticles (and helicities are reversed, and also angular momentum 
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components if spherical waves are used). In particular, for the diagonal elements, 
Ty = Ti. 


It therefore follows from (71.8) and (71.9) that the total cross-section for all 
possible processes (with a given initial state) is the same for reactions of particles 
and of antiparticles. 

In particular, the total disintegration probabilities (i.e. the lifetimes) of the 
particle and the antiparticle are equal. These results, together with the equality of 
particle and anti-particle masses ($11), are most important consequences of the 
CPT invariance of the interactions. A similar statement for each possible disin- 
tegration channel separately would require CP invariance also (see the end of §69). 


PROBLEM 


From the unitarity condition, find the relation between the phases of the partial amplitudes for 
photoproduction of pions from nucleons (y+ N—7a7+N) and elastic scattering of pions by nucleons 
(7 +N--—>7+N), using the fact that aN scattering depends on strong interactions but photoproduction 
and yN scattering depend on an electromagnetic interaction. 


SOLUTION. Let the partial amplitudes be denoted by 
(aN|S|yN) = Say, (yN|S|yN) = Sy, (aN|S|aN) = San 

the suffix J and the helicity suffixes being omitted. Photoproduction is a first-order process with respect 
to the charge e, and yN scattering a second-order process; hence Sry ~ e, Sy — 1~ e*. The amplitude S,. 
is not small. The conditions (71.1) give, as far as terms in e, 

Soy + Siaooa ae Se + SiOvd o 0, (1) 
on the right-hand side of (2), 1 denotes a unit matrix in the spin variables. Because of T invariance the 
matrix S,, is symmetric, and S,, = S,,. Let us take the matrix S,, in diagonal form, i.e. with respect to 


pion states having definite parities; then it follows from (2) that the diagonal elements have the form e7 
with various constants 6,. Then (1) gives for each element of the matrix S,, 


2i8, 
Say Sky = sale <4 3 
whence 
- id 
Say = +|S,,lie°. 


Thus the phase of the partial amplitude for photoproduction (in a state having a definite parity) is 
determined by the phase of elastic 7N scattering. 


CHAPTER VIII 


INVARIANT PERTURBATION THEORY 


§72. The chronological product 


THE probabilities of various processes in collisions between particles whose 
interaction may be regarded as small are calculated by means of perturbation 
theory. In its ordinary form (in non-relativistic quantum mechanics), however, the 
formalism of this theory has the defect of not exhibiting explicitly the conditions of 
relativistic invariance. Although, when this formalism is applied to relativistic 
problems, the final result will satisfy these conditions, the calculations are con- 
siderably complicated by the non-invariant form of the intermediate expressions. 
The present chapter will deal with the development of a consistent relativistic 
perturbation theory free from this defect, first established by R. P. Feynman 
(1948-1949). 

With a view to a second-quantization description of the system, let ® denote its 
wave function in the “space” of occupation numbers for the various states of free 
particles. The Hamiltonian of the system is H = H)+ V, where V is the interaction 
operator. Let ®, be the eigenfunctions of the unperturbed Hamiltonian, each 
corresponding to certain definite values of all the occupation numbers. Any 
function ® can be expanded as © = C,@®,. Then the exact wave equation 


ia@/at = (Hy) + V)® (72.1) 


becomes a set of equations for the coefficients C,: 
IC2=D2V ge MG, (72.2) 


where Vnmn are the time-independent matrix elements of the operator V, and E, the 
energy levels of the unperturbed system (cf. QM, §40). 

By definition, the operator V does not depend explicitly on the time. The 
quantities 


Vam(t) = Vom ei En Em)t (72.3) 


on the other hand, may be regarded as matrix elements of the time-dependent 
operator 


V(t) = effet Ve Hot, (72.4) 
This 1s said to be an operator in the interaction representation, as opposed to the 


283 


284 Invariant Perturbation Theory 872 


original time-independent Schrédinger operator V.t Now denoting the wave func- 
tion in this new representation by the same letter ®, we can write equations (72.2) 
symbolically as 


id = V(t)®. (72.5) 


The change in the wave function in this representation is due entirely to the action 
of the perturbation, i.e. it corresponds to processes which result from the inter- 
action of the particles. 
If O(t) and P(t + St) are the values of ® at two successive instants, (72.5) shows 
that 
P(t + St) = [1 — iSt - V(t)]@(t) 
= grit : YO@(t), 


Accordingly the value of ® at any instant t; can be expressed in terms of its value 
at some initial instant t; (<t;) by 


(t;) = (11 eH )O(t), (72.6) 


where the product [|] is the limit of the product over all the infinitesimal intervals 


dt, between t; and t,. If V(t) were an ordinary function, this limit would reduce 
simply to 


exp(—i f V(t) dt), 


but this result depends on the commutativity of the factors pertaining to different 
instants, which is assumed in changing from the product in (72.6) to the summation 
in the exponent. For the operator V(t) there is no such commutativity, and the 
reduction to an ordinary integral is not possible. 

We can write (72.6) in the symbolic form 


O(t;) =T exp|—i f V(t) at l(t), (72.7) 


where T is the chronological operator, implying a certain ‘‘chronological’’ sequence 


+ It must be emphasized that the definition (72.4) makes use of the unperturbed Hamiltonian Hp. In 
this it differs from the Heisenberg representation of operators, where 


V(t) A eit wy e itt. 


see QM, §13. 
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of time instants in the successive factors of the product (72.6). In particular, putting 
t; > —~, t; >+, we have 


D(+~) = $@(—~), (72.8) 
where 
§=T exp|—i i V(t) at}. (72.9) 


The significance of writing the formally exact solution of the wave equation in 
the form (72.7)-(72.9) is that it easily leads to the series in powers of the 
perturbation 


g= 5 CD | dt, | deo. i dt, -T{V(t,) V(t)... Vt}. (72.10) 


— 0 


Here, in each term, the kth power of the integral is written as a k-fold integral, and 
the symbol T signifies that in each range of values of the variables f), t2,..., t, the 
corresponding operators must be put in chronological order, with the value of t 
increasing from right to left.t 

It is evident from the definition (72.8) that, if the system was in a state ®; (an 
assembly of free particles) before the collision, the probability amplitude for a 
transition to a state ®; (another assembly of free particles) is the matrix element Sj. 
Thus these form the S-matrix. 

The electromagnetic interaction operator has already been given in 843: 


V=e [ GA) d°x. (72.11) 
Substitution of this in (72.9) gives 
S=T exp —ie | (jA) d*x } (72.12) 


It is important to note that the operator (72.12) is relativistically invariant. This is 
seen from the facts that the integrand is a scalar, the integration over d‘x is 
invariant, and the time-ordering operation is invariant. The last point, however, 
needs further explanation. 

The order of two time instants ft, and fp, i.e. the sign of t2—t), is independent of 
the frame of reference chosen if these instants relate to world points x; and x, 
separated by a time-like interval: (x.—.x,)’>0. In such a case the invariance of 


+ The derivation of the rules of relativistic perturbation theory by means of the expansion (72.10) ts 
due to F. J. Dyson (1949). 


QE4-T 
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time-ordering necessarily follows. But if (x.—x,)’<0 (a space-like interval), we 
may have both t,>t, and t,<t, in different frames of reference.t Now two such 
points correspond to events between which there can be no causal connection. It is 
therefore evident that the operators of two physical quantities relating to such 
points must commute, since the non-commutativity of operators signifies, physic- 
ally, that the corresponding quantities cannot be measured simultaneously, and this 
presupposes a physical connection between the two measurements. Thus the 
time-ordering of the product remains invariant in this case also: though a Lorentz 
transformation may reverse the sequence of time instants, the factors commute and 
can therefore be restored to their chronological order.# 

It is easy to see that the definition of the S-matrix given in this section 
necessarily satisfies the unitarity condition. Writing S as the chronological product 
in (72.6) and using the fact that V is Hermitian, we find that S* is given by the 
product of similar factors, exp[idt, - V(t,)] (with the opposite sign of the exponent), 
in the reverse of the chronological order. Thus all the factors cancel in pairs when 
S is multiplied by S*. 

It should be noted that the unitarity of the operator S$ is ensured in this case 
because the Hamiltonian is Hermitian. The unitarity condition is actually more 
general than the assumptions on which the theory given here is based. It must be 
satisfied even in a quantum-mechanical description which makes no use of the 
concepts of the Hamiltonian and the wave functions. 


§73. Feynman diagrams for electron scattering 


We shall show by means of specific examples how the scattering matrix 
elements are calculated. These examples will facilitate the subsequent formulation 
of the general rules of invariant perturbation theory. 

The current operator j contains the product of two electron w-operators. Hence 
processes might occur in the first order of perturbation theory which involve (in the 
initial and final states) only three particles: two electrons (the operator j) and one 
photon (the operator A). It is easily seen, however, that such processes cannot 
occur between free particles, being forbidden by the laws of conservation of 
energy and momentum. If p; and p2 are the 4-momenta of the electrons, and k that 
of the photon, the conservation of 4-momentum would be represented by k = 
P2— Pp; or k = p2+ p;. But such equations are impossible, since for a photon k* = 0, 
whereas the square (p»+ p,)’ is certainly not zero: if we calculate this invariant in 


+ Instead of using the terms ‘‘time-like’ and “‘space-like”’, we often refer briefly to regions 
respectively inside and outside the light cone: all points x separated from a point x’ by an interval such 
that (x — x’ >0 lie within a double cone having its vertex at x’; points for which (x — x' <0 lie outside 
this cone. 

+ This statement needs refinement to avoid misunderstanding in its application to the product 
V(ti)V(tr).. . Since the operator V itself is not gauge-invariant (it varies with A), the factors V(t), 
V(t), . _ though commuting in one gauge of the potential, may be non-commutative in some other 
gauge. The statements made above must therefore be formulated as asserting the possibility of choosing 
a gauge for the potential in which V(t)) and V(t2) commute outside the light cone. This reservation 
clearly has no effect on the invariance of the S-matrix: the scattering amplitudes, which are actual 
physical quantities, cannot depend on the gauge of the potential, a result which formally follows from 
the gauge invariance of the action integral ($43). 
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the rest frame of one of the electrons, we have 


(p2 + pi) = 2(m? + pipr) 
= 2(m?+ E\f2F Pi ° P2) 


=2m(m + £2), 
and, since e, >m, it follows that 


(po+pi)>0, (pr-pi) <0. (73.1) 


Thus the first non-vanishing (non-diagonal) elements of the S-matrix can appear 
only in the second order of perturbation theory. All the relevant processes are 
comprised in the second-order operator obtained by expanding the expression 
CPAPA 


$= -£ ff ateate TEA, COMEAM) 


Since the electron and photon operators commute, the T product can be resolved 
into two: 


2 
§% = or | i d*x d*x'» TG#(x)}’(x)T(A, (x) A,(x')). (73.2) 


As a first example, let us consider elastic scattering of two electrons. In the 
initial state there are two electrons with 4-momenta p, and p:, in the final state two 
electrons with other 4-momenta p; and p,. It is also assumed that all the electrons 
are in definite spin states; the spin variable indices will be everywhere omitted, for 
brevity. 

Since there are no photons in either state, the required matrix element of the T 
product of the photon operators is the diagonal element (0|.. . |0), where |0) denotes 
the photon vacuum state. This value of the T product averaged over the vacuum is 
(for each pair of indices p, v) a definite function of the coordinates of the two 
points x and x’. Since 4-space is homogeneous, the coordinates can appear only as 
the difference x — x’. The tensor 


D,,,(x — x') = i(0|/TA, (x) A,(x)]0) (73.3) 
is called the photon propagation function or photon propagator. It will be cal- 
culated in §76. 

For the T product of the electron operators, we have to calculate the matrix 
element 


(34|Tj*(x)j"(x)|12), (73.4) 


where the symbols |12), |34) denote states in which pairs of electrons have the 
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corresponding momenta. This element also can be represented as a vacuum 
expectation value, by using the obvious relation 


(2|F|1) = (O|a2Fa7|0), 


where F is any operator, dj the creation operator for the first electron and d, the 
annihilation operator for the second electron. Hence, instead of (73.4), we can 
calculate the quantity 


(OlasagT(j"(x)j"(x'))aza7)0), (73.5) 


the indices 1, 2,... being abbreviations for pi, p2,.... 
Each of the ne current operators is a product, j = wy, and each of the 
w-operators 1S a sum: 


b= > (Gp, + by y-p), =>) (apd + Bypb-p); (73.6) 


P 


the second term in each expression contains the positron operators, which in the 
present case “do not act”. Hence the product j“(x)j’(x’) is a sum of terms, each 
containing the product of two operators d, and two 4G,. These operators must 
annihilate electrons 1 and 2, and create electrons 3 and 4. They must therefore be 
the operators 4}, d >, G3, G4, which are said to contract with the “external” 
operators Gd], G3, G3, G4 in (73.5) and cancel according to the equations 


(O|apa>;|0) = 1. (73.7) 
Four terms result, according to the w-operators from which d, da, G3, @4 in 
(73.5) are taken: 
ee eee es, Ore pee, ee, nag atl 
(73.5) = azad(by"b)(p'y"b')azai + asadpy Why’ )azai + 
eres nee eee | LS ed Leese 
SS). ed a 


+ aadby' Wb y'b')azat + aadpy"pb'y’b')azat, (73.8) 
| Cia || i ene ar eee | J 


where Ww = (Xx), W' = w(x’), and the can join operators which contract, Le. 
those from which a pair of operators d, a” is taken for the cancellation according to 
(73.7). In each term we can bring the oie operators together in pairs (4,471, 
etc.) by successive interchanges of d,, d2,..., and the mean value of their product 
is then equal to the product of the mean values (73.7). Since all these operators 
anticommute (1, 2, 3, 4 being different states),f we find that the matrix element 
(73.4) is 


(34|Tj*(x)7"(x’)[12) = Chay" bry') + bsy"di (bey) — 
— (Yay"dy) (bay) — bey" bb yy "W). (73.9) 


+ Because of this anticommutativity, the operators j(x) and j(x’) may here be considered to 
commute (in the calculation of the matrix element), and the T product symbol may therefore be omitted. 
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The sign of the entire sum is arbitrary, and depends on the order of the ‘“‘external’’ 
electron operators in (73.5). This is in accordance with the fact that the sign of the 
matrix element for scattering of identical fermions is itself arbitrary. The relative 
sign of the various terms in (73.9), of course, does not depend on the order of the 
external operators. 

The two terms in each line of (73.9) differ only by a simultaneous interchange of 
the indices pw, v and the arguments x, x’. This interchange clearly does not affect 
the matrix element (73.3), in which the order of factors is still established by the 
symbol T. Hence when (73.3) is multiplied by (73.9) and integrated over d‘x d‘x’, 
the four terms in (73.9) give two pairs of equal results, and the matrix element is 
therefore 


Sp = ie? | | d*x d*x! Dix — x’ H(dey"WDsy'W) — Day" By w Ok: 
(73.10) 


the factor 3 has now disappeared. 
The electron wave functions are the plane waves (64.8). The expression in the 
braces 1s therefore 
i } = (iisy"Ur)(isy"U1) e '(P2—Pa)x—i(P—P3)x" _ 
— (ilary"*U1)(tisy"Un) @1P PO KP2 Pde 
= {(iigy"U2)(iizy"U,) eo illp2-Pa)+(p3~PNgl2 _ 
= (iigy"U)(isy"U2) e Mi pad (93 P2182 e lPrtP2-P3-Pa)X | 
where X =3(x +x’), €=x-—x'. The integration over d*x d‘*x' is replaced by one 
over d*é d*X. The integral over d*X gives a delta function, so that p,+ po = p3+ ps. 


Then, changing from the matrix S to the matrix M (864), we have finally for the 
scattering amplitude 


Myi = e*{(igy“U2)D,,, (Pa — D2) (iis "Uy) — (tay “Uy)D,.,. (pa pd (isy"ur)}. (73.11) 


Here we have used the photon propagation function in the momentum represen- 
tation: 


D,,(k) = | D,,(€) e™ dé. (73.12) 


Each of the two terms in the amplitude (73.11) can be symbolically represented 
by means of a Feynman diagram: the first term by 


e?(dyyu,) D,, (k) (Ugy"4,) == (hac) 3) 


P, P, 
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Each point of intersection of lines (a vertex of the diagram) has a corresponding 
factor y. The “incoming” continuous lines towards a vertex represent the initial 
electrons, which are associated with the factors u, the bispinor amplitudes of the 
corresponding electron states. The “‘outgoing”’ continuous lines leaving a vertex are 
the final electrons, and correspond to the factors #. When the diagram is “‘read’’, 
these factors are written from left to right in the order of movement along the 
continuous lines against the direction of the arrows. The two vertices are joined by 
a broken line which represents a virtual (intermediate) photon ‘‘emitted’’ at one 
vertex and “absorbed” at the other, and corresponds to the factor —iD,,(k). The 
4-momentum of the virtual photon k is determined by the “conservation of 
4-momentum” at the vertex: the total momenta of the incoming and outgoing lines 
are equal. In this case k = pj — p3= Pa— pr. AS well as the factors mentioned, the 
whole diagram is also assigned a factor (—ie)* (the exponent being the number of 
vertices in the diagram), and then represents a term in iM;;. Similarly, the second 
term in (73.11) is represented by the diagram 


e7(G4y7Uy) Ou,(k’) (Gg y7Uy) == yk (73.14) 


P, P. 


wv 


with k' = p, — p4= p3— p2. It does not matter whether the diagram is read from the 
end of p; or from that of py. The resulting expressions are equal, because the tensor 
D,, is symmetrical. The choice of direction for the virtual photon line is also 
immaterial: a change in its direction simply reverses the sign of k, which does not 
matter, since the functions D,,(k) are even (see §76). 

The lines corresponding to the initial and final particles are called the external 
lines or free ends of the diagram. The diagrams (73.13) and (73.14) differ by the 
interchange of two electron free ends (p; and p,). This interchange of two fermions 
reverses the sign of the diagram, in accordance with the fact that the two terms 
appear with opposite signs in the amplitude (73.11). 

We shall everywhere use Feynman diagrams in this momentum representation, 
but they can also be associated with the terms in the scattering amplitude in the 
original coordinate representation (the integrals (73.10)). Here the electron am- 
plitudes are replaced by the corresponding coordinate wave functions, and the 
propagators are in the coordinate representation. Each vertex corresponds to one 
of the variables of integration (x or x’ in (73.10)); the factors assigned to the lines 
that meet at a vertex are taken as functions of the corresponding variable. 

Let us now consider the mutual scattering of an electron and a positron; their 
initial momenta will be denoted by p_ and p,. respectively, and their final momenta 
by p“ and pi. 

The positron creation and annihilation operators appear in the w-operators 
(73.6) together with the electron annihilation and creation operators respectively. 
Whereas in the previous case the operator ip annihilated the two initial particles 
and w& created the two final particles, here these operators act oppositely with 
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regard to electrons and positrons. The conjugate function #(—p,) will therefore 
now describe the initial positron, and w(—p‘) the final positron, both being 
functions of the 4-momentum with reversed sign. Taking account of this difference, 
we obtain the scattering amplitudet 


Myi = —e*(ii(p)y“u(p_))D,.,(p- — p)(i(— p+) y"u(— p4,)) + 
+ e*(i(—ps+)y"u(p_))D,.,(p- + p+)(u(p_)y"u(— p4)). (73.15) 


The two terms in this expression are represented by the following diagrams: 


ig oll? : - a ae ” 


y p-P! { p.+p, (73.16) 


at Tings ee -pt 


The rules for constructing the diagrams are altered only as regards the positrons. 
The incoming and outgoing continuous lines are again associated with factors u and 
u respectively. Now, however, the incoming lines correspond to final positrons and 
the outgoing lines to initial positrons, the momenta of all the positrons being taken 
with reversed sign. 

The difference between the two diagrams (73.16) should be noted. In the first 
diagram, lines of the initial and final electrons meet at one vertex, and those of the 
two positrons at the other. In the second diagram, initial electron and positron lines 
meet at one vertex, and final lines at the other. The upper vertex represents 
annihilation of a pair with emission of a virtual photon; the lower vertex represents 
the creation of a pair from this photon. 

This difference affects the properties of the virtual photons in the two diagrams. 
In the first diagram (“‘scattering” type), the 4-momentum of the virtual photon 1s 
the difference between those of the two electrons (or positrons); hence k? <0 (cf. 
(73.1)). In the second diagram (“‘annihilation” type), k’= p+ p+, and hence k” > 0. 
Here it should be noted that for a virtual photon we always have k* #0, unlike a 
real photon, for which k’ = 0. 

If the colliding particles are not identical and also not a particle and its 
antiparticle (for instance, an electron and a muon), then the scattering amplitude is 
represented by a single diagram: 


ple) pe) 


(73.17) 


1k 
die (4) 


p 


+ The sign of the whole amplitude is definite in the scattering of non-identical particles, being 
determined by the fact that in (73.5) the ‘“‘external’”’ operators must be arranged so that the two electron 
operators are both at the ends: (O|a'b’...b‘a"|0) (or both in the middle); this condition ensures the 
“same sign” of the initial and final vacuum states. The sign of the amplitude can also be verified from 
the non-relativistic limit: we shall see later ($81) that, in this limit, the second term in (73.15) tends to 
zero, and the first term tends to the Born amplitude of Rutherford scattering. 
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There can be no annihilation or exchange type diagram in this case. The same 
result can be obtained analytically by writing the current operator as the sum of 
electron and muon currents: 


A 
® 


j= j? + j 
ase (fy) 4 (Ys yp) 


and taking, in the product j“(x)j(x'), the matrix elements of terms which give the 
required annihilations and creations of particles. 

Let us now consider first-order processes, which, as mentioned at the beginning 
of this section, are forbidden by the conservation of 4-momentum. The matrix 
elements of the operator 


GW i | ioa@ us (73.18) 


for such transitions correspond to the creation or annihilation of three real particles 
(two electrons and one photon) at “the same point x”. They occur by the 
contraction of the operators h(x) and W(x) at the same point, and are expressed, 
for example in the case of photon emission, by integrals of the form 


Si = ~ie | PxcoW()yA*O) d's, 


which vanish because the integrand includes the factor exp[—i(p; — p2—k)x] witha 
non-zero exponent. In the language of Feynman diagrams, this means that diagrams 
with three free ends such as 


a. (73.19) 
are Zero. 


For the same reason, second-order processes involving six particles in the initial 
state (or in the final state) are impossible. In the matrix element S,; for such a 
transition, the integral over d*x d*x' would separate into a product of two vanishing 
integrals over d*x and d*x' of products of three wave functions taken at the same 
point. In other words, the corresponding diagram would separate into two in- 
dependent diagrams of the type (73.19). 


§74. Feynman diagrams for photon scattering 


Let us now consider another second-order effect: the scattering of a photon by 
an electron (the Compton effect). In the initial state let the photon and the electron 
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have 4-momenta k, and p,, and in the final state k, and p» (and also definite 
polarizations, which will be omitted for brevity). 
The photon matrix element ts 


QITA,(x)A(x))|1) = OleoTA, (x) A,(x’ct]0), (74.1) 


where 
A=), (Ay + Cf A¥). 
k 
Contraction of the external and internal operators gives 
ee a 
(74.1) = c,A,Aici + c2A,A,cr 
= AX, Ai, + AypAX, (74.2) 


where we have used the commutativity of the operators ¢; and ¢3; for the same 
reason, the symbol T can here be omitted. 
The electron matrix element ts 


(2\Tj*(x)j°(x 1) = OlaaTaby"h)(b'y"b')a7|0). (74.3) 


This involves four w-operators. Only two are concerned with the annihilation of 
electron | and the creation of electron 2, and will be contracted with the operators 
G{ and @. These may be w’, & or W’, w (but not &, w or Ww’, W': the creation and 
annihilation of two real electrons and one real photon at the same point x or x’ 
would give an expression equal to zero). By carrying out the two possible ways of 
contraction, we obtain two terms in the matrix element (74.3). These will first be 
written on the assumption that t > t’: 


es os . ee ae by 
(74.3) = aly" W\h'yW ai + ay iby Wai, (74.4) 


In the first term the contracted operators are dow > G43, bat > daiut. 
Since the operators G24; and d,d; are diagonal and appear at the end of the 
products, they can be replaced by the vacuum expectation value, i.e. unity. To 
make a similar transformation in the second term of (74.4), the operator Gd; must 
first be “‘pulled’’ to the left, and da, to the right. This is done by means of the 
commutation rules for the operators G,, 4): 


ip, Ww}, = it, wh. =0, 
idp, WH. = {dp, U} 74.5) 


{dp, uy, = Wy, {a;, wh, a Wp. 
Then (74.4) becomes 


Olay baby) —(by"wyhy'h))0), tt >t (74.6) 
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only the operator factors are averaged, of course. Similarly, for t < t', we obtain an 
expression differing by the interchange of p and v and of the primed and unprimed 
symbols: 


(O]—(h' yb) oy"h) + (bsy'bby"h))0), t<t’. (74.7) 


The two expressions (74.6) and (74.7) can be written as one by using the 
chronological product of the w-operators: 


Thx)n(x)= (x(x), th <t: 


A : (74.8) 

=—UW(x)b(x), t' >t, 
where i and k are bispinor indices. Then the first and second terms in (74.6), (74.7) 
can be combined in the form 


doy'XO|T ys + b'0)y'h + hy" (O|T eh’ - WlO)y"h, (74.9) 


where us - W denotes the matrix Wah. 

It should be noted that, in the natural definition (74.8), the operator products are 
taken with opposite signs for t<t' and t >t’. In this respect it differs from the 
definition of the T product which has been used for the operators A and j. This 
difference arises because the fermion operators & and w anticommute outside the 
light cone, unlike the commuting boson operators A and the bilinear operators 
j = wyd.t This procedure ensures the relativistic invariance of the definition (74.8). 
A formal proof of the commutation rules for the w-operators will be given in §75.4 

We shall define the electron propagation function or electron propagator, a 
bispinor of rank two, as 


G(x — x') = —i€0| Tui (x) i (x') 10). (74.10) 
Then the electron matrix element becomes 
(2|Tj* (x) XDD = try"Gy'h + ibby’Gy"Wn. (74.11) 


On multiplication by the photon matrix element (74.1) and integration over d*x d*x', 
the two terms in (74.11) give the same result, and so we have 


Sj = — ie? | i d*x d*x' Wo(x)y"G(x — x')y’hy(x') X 


x {AR (x)Ai(x') + A(X) Arp (X)}- (74.12) 


+ The w-operators themselves, it will be recalled, correspond to no measurable physical quantities, 
and therefore need not commute outside the light cone. 

+ The T product of any number of y-operators may be defined similarly. It is equal to the product of 
all the operators arranged in order of increasing time from right to left, the sign being determined by the 
parity of the interchange needed to obtain this order from the order shown under the T product symbol. 
Accordingly, this sign changes when any two w-operators are interchanged; for example, 


Tix) u(x!) = — Td (x’i(x). 
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Substituting the plane waves (64.8), (64.9) for the electron and photon wave 
functions and separating the delta function as in (73.10), we obtain finally the 
scattering amplitude 


My = —4rre*itx{(ye$)G(p, + ki(yei) + (ye:)G(pi — k2)(yes)}un, (74.13) 
where @, é2 are the photon polarization 4-vectors and G(p) the electron propagator 
in the momentum representation. 


The two terms in this expression are represented by the following Feynman 
diagrams: 


(74.14) 


Are®Uslye;) G (fF) (ved) u, 


The broken-line free ends of the diagrams correspond to real photons; the 
incoming lines (initial photon) are associated with a factor V(4m)e, and the 
outgoing lines (final photon) with a factor V(47)e*, where e is the polarization 
4-vector. In the first diagram, the initial photon 1s absorbed together with the initial 
electron, and the final photon is emitted together with the final electron. In the 
second diagram, the final photon is emitted together with the annihilation of the 
initial electron, and the initial photon is absorbed together with the creation of the 
final electron. 

The continuous internal line joining the two vertices represents a virtual 
electron whose 4-momentum is determined by the conservation of the 4-momentum 
at the vertices. This line is associated with a factor iG(f). Unlike the 4-momentum 
of a real particle, that of the virtual electron has a square which is not equal to m’. 
If the invariant f* is considered, for example, in the rest frame of the electron, we 
easily find that 


f'=(pitky>m’, f? =(pi— k2)? <M’. (74.15) 


$75. The electron propagator 


The propagation functions or propagators defined in 8873 and 74 are of 
fundamental importance in the formalism of quantum electrodynamics. The photon 
propagator D,, is a basic characteristic of the interaction of two electrons, as is 
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shown by its position in the electron scattering amplitude, in which it is multiplied 
by the transition currents of the two particles. The electron propagator plays a 
similar part in the electron—photon interaction. 

Let us now calculate the actual values of the propagators, taking first the 
electron propagator. Let the operator yp — m, where p, = id,, act on the function 


Gu(x — x') = —iO|T Yi) e(x')I0), (75.1) 


i and k being bispinor indices. Since h(x) satisfies Dirac’s equation (yp — m)b(x) = 0, 
we find that the result is zero at all points x, except those for which t = t'. The reason is 
that G(x — x’) tends to different limits as t>t’+0 and t-t'—0: according to the 
definition (74.8) these limits are respectively 


—i(Olbi(r, thy (x, t)|0) and +i(Oly(r’, t)yi(r, 1/0), 


and, as we shall see, they are not the same on the light cone. This causes an additional 
delta-function term to appear in the derivative dG/dt: 
a or ee cL Ae Fer 


Ot 0) + &(t — t')(Grr-+0 — Gror-o). (75.2) 


Since the derivative with respect to t appears in the operator yp —m in the form 
iy® d/dt, we therefore have 


(yP — m)xGu(x — x') = 5(t = t')yidOl{da(r, t), b(t’, t)},10). (75.3) 


The antigommutator Is calculated as follows. On multiplying the operators 
w(r,t) and wr’, t) (see (73.6)) and using the rules for interchange of the fermion 
operators d,, bp, we find 


{ui(r, t), WC’, t)}. = 2 [hpi Cr) Pipx (r’) + Wp, i(r) ys p,k(r') I], (75.4) 


where wW.)(r) are wave functions without the time factor; as in $873 and 74, the 
polarization indices are omitted for brevity. The set of all functions w.,(r), which 
are eigenfunctions of the electron Hamiltonian, forms a complete set of normalized 
functions, and according to the general properties of these (cf. QM, (5.12)) we have 


py [dpi (ro (r’) + wp (rb, .(r’)] = 8.6(r — r’). (75.5) 


The sum on the right-hand side of (75.4) differs from that in (75.5) in that wW is 
replaced by (*y°),, and its value is y.6(r—r’). Thus 


{h(r, t), U(r’, ty}. = d(r—r')yk. (75.6) 


From this formula it follows, in particular, that the operators Ww and w anti- 
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commute outside the light cone, as stated in §74. When (x — x’)? <0 there is always 
a frame of reference in which t = t’; if then r#r’, the anticommutator (75.6) is in 
fact zero. 

Substituting (75.6) in (75.3) (and omitting the bispinor indices), we have finallyt 


(yp — m)G(x — x’) = 6 (x — x’). (75.7) 


Thus the electron propagator satisfies Dirac’s equation with a delta function on 
the right-hand side. Mathematically speaking, therefore, it is the Green’s function 
for Dirac’s equation. 

We shall later be concerned not with the function G(€) itself (€ = x — x’), but with 
its Fourier components: 


G(p) = i G(é) e” ate (75.8) 


(the propagator in the momentum representation). Taking the Fourier component 
of each side of (75.7), we find that G(p) satisfies the algebraic equations 


(yp ~ m)G(p)= 1, (75.9) 
the solution of which is 
G(p) = =™ (75.10) 


The four components of the 4-vector p in G(p) are independent variables, not 
related by p°=po—p =m’. Writing the denominator in (75.10) as pj— (p’ + m’), we 
see that G(p), as a function of po for given p’, has two poles at po = +e, where 
e = V(p’+ m’). Thus, in the integration with respect to po in the integral 


G(é) ony i e-”€ G(p) d'p 
ony [ ap -e* | dpe” Gin) (75.11) 


(where t = t — t’), the question of avoiding the poles arises; until this is decided, the 
expression (75.10) remains essentially indeterminate. 

To settle this question, we go back to the original definition (75.1), and 
substitute in it the w-operators as the sums (73.6), noting that the only non-zero 
vacuum expectation values are those of the following products of creation and 
annihilation operators: 


(Olapaz|0)=1, — (0|bbz]0) = 1. 


+ The explicit form, including the bispinor indices, is 


(yp — m)uGu(x — x’) = 8 (x — x')Sx. (75.7a) 
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(Since in the vacuum state there are no particles, a particle has to be “created” by 
the operator d; or b} before it can be “annihilated” by 4, or b,.) The result is 


Gi (x —x')=—i Dd Voi, Quer’, t’) 


=i eh wir) dor’) fort—t’>0; 
: (75.12) 
G(x —x') =i d, bp a(t’, t-p, Ar, t) 


=i dye On, (ry) e’)  fort—t'<0. 
Pp 


For t >t' only the electron terms contribute to G, and for t <t’ only the positron 
terms. 

If the summation over p is replaced by an integration over d°p, a comparison of 
(75.12) and (75.11) shows that the integral 


i e-0" G(p) dpy (75.13) 


must have a phase‘factor e “” for 7 >0 and e“’ for +t <0. This can be achieved by 
passing above the pole po=e and below po=~—e in the plane of the complex 
variable po: 


One ee ee (75.14) 


For, when + >0, the path of integration is closed by an infinite semicircle in the 
lower half-plane, so that the value of the integral (75.13) is given by the residue at 
the pole pop = +e; when 7 <0, the path is closed in the upper half-plane, and the 
integral is given by the residue at the pole po=—e. The desired result is thus 
obtained in each case. 

This rule for avoiding the poles (Feynman’s rule) can be differently stated as 
follows: the integration is everywhere along the real axis, but the mass m of the 
particle is given an infinitesimal negative imaginary part: 


m—>m — 10. (75.15) 
We then have 
¢ > V[p?+(m — i0)] 


= V[p’+ m?— i0] 


=e—i0. 
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The poles po = +e are therefore moved off the real axis: 


—eE+i6 
e 


ge ee SS (75.16) 
0 e 
+e—-10 


and the integration along this axis is equivalent to integration along the path 
(75.14).+ Using the rule (75.15), we can write the propagator (75.10) in the form 


__ yp rm __ 1 


The rule of integration with displaced poles can be proved by means of the 
relation 


i 1. 

eT P a im75(x), (75.18) 
which 1s to be taken in the sense that multiplication by any function f(x) and 
integration gives 


ioe) 


[ a= [| ax - into (75.19) 


—o 


(the symbol P denoting the principal value). 
The Green’s function (75.10) is the product of the bispinor factor yp + m and a 
scalar, 


G(p) = 1/(p?- m’). (75.20) 


The corresponding coordinate function G(é) is evidently a solution of the 
equation 


(p> — m’)G (x — x') = 6% (x - x’), (75.21) 


i.e. it is the Green’s function of the equation (p?— m’*)ws =0. In this sense we can 
say that G(x — x’) is the scalar-particle propagator. It is easily seen by calculation, 
in the same manner as above, that the scalar field propagation function can be 
expressed in terms of the w-operators (11.2) by 


Gx — x’) = —i0|/T w(x) b*(x’)10), (75:22) 
which is analogous to the definition (75.1). The chronological product is defined (as 


+ It is useful to note that the rule for moving the poles corresponds to an infinitesimal damping of 
G(x — x') with respect to |r| =|t — t'|: if the value of po at the displaced poles is written as —(e — id) and 
+(e —id), with 6 > +0, then the time factor in the integral (75.13) becomes exp(—ie|t| — 6]r]). 
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for all boson operators) by 


Anh a On ee t(y! : od ee 
W(x) h(x) = Wx) p* (x2) | (75.23) 


= bi (x'\b(x), t<t’, 


with the same sign for both t >t’ andt <t’. 


§76. The photon propagator 


Hitherto we have been concerned (in §843 and 74) with the explicit form of the 
electromagnetic field operator A only in finding the matrix elements with respect to 
a change in the number of real photons. For this purpose it was sufficient to use the 
representation (§2) of the free field potentials in terms of transverse plane waves. 

This representation, however, does not give a complete description of every 
field, as is clear from the fact that the scattering diagrams (73.13), (73.14) must also 
take account of the Coulomb interaction of the electrons. The latter is described by 
the scalar potential ® and certainly cannot be reduced to an exchange between 
transverse virtual photons (describable by a vector potential such that div A = 0).T 

Thus we have as yet essentially no complete definition of the operators A, and 
without this it is impossible to carry out a direct calculation of the photon 
propagator by means of the formula 


D,, (x — x’) = i(O|TA, (x) A.(x')/0). (76.1) 


On the other hand, the fact that the potentials are not gauge-invariant deprives of 
much of their physical meaning the operators which would be needed for a 
complete quantization of the electromagnetic field. 

These difficulties, however, are purely formal, not physical, and can be avoided 
by using certain general properties of the propagator, which are evident from the 
requirements of relativistic invariance and gauge invariance. 

The most general 4-tensor of rank two which depends only on the 4-vector 
€=x-x’' Is 


Dww(E) = Su-D(E*) — 4,,DE), (76.2) 


where D and D" are scalar functions of the invariant é?.t This tensor is necessarily 
symmetrical. 


+ With the condition div A = 0, Maxwell’s equations lead to the following equations for A and ®: 
DA=—4aj+V se KO =H. 


In this gauge, the potential ® satisfies the static Poisson’s equation; cf. (76.13) below with Doo in the 
same gauge. 

+ These functions are different in the three ranges of values of the argument which are not mutually 
interchanged by Lorentz transformations: the regions outside the light cone (é’ <0), and within its two 
parts (£7 >0; & >0, & <0). 
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In the momentum representation, we correspondingly have 
D,,(k) = D(k’)g,, + k,k,D(k’). (76.3) 


where D(k’), D(k’) are the Fourier components of the functions D(é’), D(é’). 

The photon propagation function, in physical quantities (scattering amplitudes), 
is multiplied by the transition currents of two electrons, i.e. it appears in com- 
binations of the form (j")2:D,,(/")a3; see, for instance, (73.13). But, because of the 
conservation of current (d,j" =0), the matrix elements j2, = yy, satisfy the 
condition of 4-transversality, 


ki (j")2 = 9, (76.4) 


where k = p.—p,; cf. (43.13). It is therefore clear that all physical results are 
unchanged by the substitution 


Di Dig * Vike + Xk: (76.5) 


where the y, are any functions of k and ko. This arbitrariness in the choice of D,, 
corresponds to that in the field potential gauge. 

The arbitrary gauge transformation (76.5) can violate the relativistically in- 
variant form D,, assumed in (76.3) if the quantities y, do not make up a 4-vector. 
But, even considering only relativistically invariant forms of the propagator, we see 
that the choice of the function D‘(k’) in (76.3) is entirely arbitrary; it does not 
affect any physical results, and can be made in any convenient manner (L. D. 
Landau, A. A. Abrikosov and I. M. Khalatnikov, 1954). 

Thus the determination of the propagation function amounts to that of a single 
gauge-invariant function D(k’). If we take a given value of k’, and the z-axis in the 
direction of k, the transformations (76.5) will not affect the components D,, = Dy, = 
—D(k’). It is therefore sufficient to calculate the component D,,, using any gauge 
for the potentials. 

We shall use a gauge in which div A=0 and the operator A is given by the 
expansion (2.17), (2.18): 


A Ds ye . 
A> J (é,,e° a ™ + fe@* el), w = |kl; (76.6) 
k, a 


the index a = 1, 2 labels the polarizations. The only non-zero vacuum expectation 
values of products of the operators ¢, é* are (O|Cxa€ka|0) = 1. Then, by the definition 
(76.1), we have 


| 3 a 5 
Dix(€) = aay | *Be (> eMe De) eines. (76.7) 


where i, k are three-dimensional vector indices; the summation over k has been 
replaced by an integration over d°k/(27)°. The absolute value of + = t — t' appears in 


the exponent, because the operator product in (76.1) is chronological. 


QE4-U 
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It is evident from (76.7) that the integrand without the factor e'*'® is the 
component of the three-dimensional Fourier expansion of the function D,(r, t). 
For D,, = —D, it is 


(27rifw) e "| S |e? = (277i/w) e7@"", 


To find D,,(k*) we now have to represent this function as a Fourier integral over 
time. The appropriate formula is 


As explained in §75, this integration is understood to be taken along a contour 
passing below the pole ko = |k| = w and above the pole kp = —|k| = —w; for t > 0 the 
value of the integral is determined by the residue at the pole ky = +a, and for tT <0 
by that at ky = —wo. 

Thus we have finally 


D(k’) = 4a/(k* + i0). (76.8) 


The term +10 in the denominator which results from this proof is in accordance 
with the rule (75.15), i0 being subtracted from the (zero) mass of the photon. It is 
evident from (76.8) that the corresponding coordinate function D(€’) satisfies the 
equation 


~3,0"D(x —x') = 478 (x — x’), (76.9) 


i.e. it is the Green’s function of the wave equation. 
We shall generally take D“ = 0, i.e. use the propagation function 


4 
D,, = 8,D(k?) = ame gu (76.10) 


(the Feynman gauge). 
There are also other gauges which may be advantageous in certain applications. 
Putting D® = — D/k’, we obtain the propagator in the form 


4 kk, 
D,» = (Bm - “i | (76.11) 


(the Landau gauge), with D,,k” = 0. This choice is similar to the Lorentz gauge for 
potentials (A,k* = 0). 

The propagator gauge conditions Dyk' = 0, Dy k' = 0 are analogous to the three- 
dimensional gauge condition div A = 0 for the potentials. Together with 


D,, = —D = —4n/k’, 
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these conditions give 


p52 (5 -“'). (76.12) 


w* — k’ kK 


In order to obtain this D,;, we must apply to the propagator (76.10) the trans- 
formation (76.5), putting 


ee ea ee 
X0 2(w? = k’)k” Xi 2(w? °o k’)k? 


The remaining components D,, are then found to be 
Doo = —An/k’, Doi = 0. (76.13) 


This is called the Coulomb gauge (E. E. Salpeter, 1952). Doo is here the Fourier 
component of the Coulomb potential. 
Finally, the propagator gauge in which 


Di = - ara bi ~ <), Do: = Doo = 0, (76.14) 
w° —k Ww 
is analogous to the potential gauge condition ® = 0. This is a convenient form for 
use in non-relativistic problems (I. E. Dzyaloshinskii and L. P. Pitaevskii, 1959). 
All the above expressions relate to the momentum representation of the 
propagator. In some cases, it is convenient to use the mixed frequency—coordinate 
representation, i.e. the function 


D,,(@, r) = | D,,,(w, k) e™'* d?k/(27r)’. (76.15) 
In the Feynman gauge (76.10) 


D,,.(@, r) = 2,,D(o, r), 
where 
_ eik-r d°k 
es | wo —k’+i0 (27) 
_ i ; eikr = eikr 
aan re TT 


or, changing k to —k in the second term in the integrand, 


co 


i ek dk 
Mone | R40 


— 00 
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The integration here is carried out by closing the contour of integration with an 
infinite semicircle in the upper half-plane of the complex variable k, and amounts to 
taking the residue at the pole k = |w|+ i0. The final result is 


D(a, r) = — l ejlolr, (76.16) 


The following comment may be made regarding this expression. The process 
described by the diagrams (73.13) and (73.14) may be intuitively regarded as the 
scattering of electron 2 in the field due to electron 1 (or vice versa). The function 
(76.16) corresponds to the usual “retarded” potential « e'’” (see Fields, (64.1), 
(64.2)) only when w > 0. The sign of w, however, depends on the arbitrary choice of 
the direction of the arrow of k in the diagram. The above-mentioned property of 
D(w, r) signifies that in quantum electrodynamics the source of the field is to be 
regarded as the particle which loses energy, i.e. emits a virtual photon. 

To conclude this section, let us also consider the problem of the propagator for 
particles with spin 1 and non-zero mass. There is then no arbitrariness of the gauge. 
and the choice of the propagator is unambiguous. 

Substituting the w-operators (14.16) in the definition 


Guy = —iO|T Hh, (x) 7X90), (76.17) 


we obtain an expression that differs from (76.7) only in that the sum over 
polarizations in the integrand is replaced by 


UU: 
aQ 


Summation over polarizations is equivalent to averaging and multiplication by 3. 
the number of independent polarizations. Averaging gives the density matrix of 
unpolarized particles (14.15). Thus we find for the propagator of vector particles 


J PuP 
a _ v 
Gyu(P) =~ a er 7g Be» ~ PH"), (76.18) 


The propagators (75.17) and (76.18) have similar structures: the denominator 
contains the difference p*—m’, and the numerator is (apart from a factor) the 
density matrix of unpolarized particles with a given spin. 


§77. General rules of the diagram technique 


The calculation of the scattering matrix elements that has been given for some 
simple cases in §§73 and 74 contains all the fundamental features of the general 
method. There is no particular difficulty in deriving the corresponding general rules 
for calculating the matrix elements in any order of perturbation theory. 

As has already been mentioned, the matrix element of the scattering operator S 
for the transition between any initial and final states is equal to the vacuum 
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expectation value of the operator obtained by multiplying S$ on the right by the 
creation operators of all the initial particles and on the left by the annihilation 
operators of all the final particles. 

This treatment puts the S-matrix element in the following form in the nth order 
of perturbation theory: 


; it 
(f|S|i) as 71 0 ete babi, . -. Aig... Cig X 


x | d*x, lege d*x,T{b(— ieyA,)y;) a (Wn(— ieyAn)Wn $C ti ean aij aaah bij Pee (0); 
(77.1) 


the suffixes li, 2i,... label the initial particles (positrons, electrons and photons 
separately), and the ‘sunines LfiS2F.. . label the final particles. The suffixes pe a 

to the operators W and A signify that 1 = (x1) and so on. The operators & and A 
which appear here are linear combinations of the creation and annihilation opera- 
tors of the corresponding particles in various states. Thus we obtain expressions 
for the matrix elements which are the vacuum expectation values of the products 
of the particle creation and annihilation operators and of their linear combinations. 
The calculation of such expectation values is effected by means of the following 
results, which constitute Wick’s theorem (G. C. Wick, 1950). 

(1) The vacuum expectation value of the product of any number of boson 
operators ¢* and € is equal to the sum of the products of all possible expectation 
values of these operators taken in pairs (contraction). In each pair, the factors must 
be placed in the same order as in the original product. 

(2) For the fermion operators d*, d, b*, b (of the same or different particles), 
the rule is the same except that each term appears in the sum with positive or 
negative sign according to the parity of the number of interchanges of fermion 
operators needed to bring together all the operators that are averaged in pairs. 

The expectation value must obviously be zero unless the product contains a 
factor d*, b*, é* for each factor 4G, b, é. Then only pairs of operators (4, 4*),..., 
pertaining to the same states are to be contracted, and moreover only those pairs in 
which 4", etc., is to the right of d, etc.: the particle is first created and then 
annihilated (whereas (O|a‘“a|0) = 0, etc.). 

If each pair (@, G"), etc. appears only once in the product, Wick’s theorem is 
obviously true, the expectation value then reducing to a single product of pairwise 
expectation values. Its validity is also evident when all the annihilation operators in 
the product are to the right of the creation operators; this is called a normal 
product. The expectation value is then zero. Wick’s theorem is now easily proved 
by induction for the general case where one pair of operators appears k times in 
the product, as follows. 

Let us consider the expectation value (0|..cc*..|0), in which the pair of boson 
operators appears k times; the argument is entirely similar for fermion operators. If 
we interchange the factors ¢ and ¢" in one pair, the commutation rules give 


(O|..cc*..|0)=(O|..c*e..|0)+(O]..1.. |0). (77.2) 
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The expectation value (0|..1..|0) contains k—1 pairs, and Wick’s theorem is 
assumed to be valid for it. If the expectation value (0|..cc*..|0) is expanded by 
Wick’s theorem, it differs from (0|..c*c..|0) by just the term 


(O]..1.. {0)<O|cc*|0) = (O].. 1. . |0); 


in the expansion of (0|..c*c..|0), the corresponding term (0|..1. .|0)<0|c*c|0) is 
zero. Hence it follows from (77.2) that, if Wick’s theorem is valid for a matrix 
element (0|..c*c..|0), it is still valid when ¢ and ¢” are interchanged. Since the 
theorem is known to be valid for one particular order of factors (the normal order), 
it is therefore true in every case. 

Since Wick’s theorem is valid for products of operators 4, b,..., it is also true 
for all products which contain the linear combinations w, w, A of d, b,..., as well 
as the latter operators themselves. On applying this theorem to the matrix element 
(77.1), we bring it to the form of a sum of terms, each term being the product of a 
number of pairwise expectation values. The latter will include contractions of the 
operators w, w, A with “external” operators—those which create the initial parti- 
cles or annihilate the final particles. These contractions are expressed in terms of 
the wave functions of the initial and final particles by the formulae 


(0|Ac,|0) = Ap; (0|c,A|0) Ad» 
(0|ba5|0) = yp, (Olay 0) = We, (77.3) 
(0|b,4|0) = Wp, (O|ysb 50) = Wp, 


where A, and wp are the photon and electron wave functions with momentum p 
(the polarization indices are omitted for brevity, as in §§73 and 74). Contractions of 
the “‘internal’’ operators in the T product will also occur. Since the sequence of 
factors in each contracted pair is preserved when applying Wick’s theorem, the 
chronological sequence of operators is preserved in these contractions, and they 
are therefore replaced by the corresponding propagators.t 

Each term of the sum obtained from the matrix element by applying Wick’s 
theorem is represented by a particular Feynman diagram. In the nth-order diagram 
there are n vertices, each corresponding to one of the variables of integration (the 
4-vectors xX), X.,...). Three lines meet at each vertex, two being continuous 
(electron lines) and one broken (photon line); these correspond to the electron 
operators i and w and the photon operator A as functions of the same variable x. 
The operator wy corresponds to the incoming line and ws to the outgoing line. 

By way of illustration, we shall give some examples of the correlation between 
the terms of the matrix element in the third approximation and the diagrams. 


+ The following comment must be made regarding this last statement. In proving Wick’s theorem, 
we have made use of the commutation rules for the operators é and é", which are meaningful only for 
real (“transverse”) photons. The “external” operators é, & do in fact, of course, correspond to such 
(initial and final) photons, but the operators A (which appear within the T product) describe, as shown 
in §76, not only transverse photons. The situation here is similar to that in the calculation of D,, ($76). 
Owing to the relativistic and gauge invariance, it is sufficient to prove the theorem for those products 
(i.e. components of the tensors (O/TA,A,...|0)) which are determined by the transverse parts of the 
potentials. The theorem is then valid for all products. 
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Omitting the integral sign and the symbol T, and also the operator symbols, the 
factors —iey, and the arguments of the operators, we can symbolically write these 
terms as 


(a) (PAY) (DAd) (PAW) = veces Moses 


Be, af ST eed —|__.—___,_-— 
(b) (YAY) (YAY) (YAY) == | 
— ee ee 
(77.4) 
| | | 
z - s a 
(c) (pad) (VAY) (VAY) = ii satis 
a | 


For clarity, the electron and photon contractions are shown by continuous and 
broken lines as in the diagrams. The direction of the arrows from & to w for the 
electron contractions is the same as in the diagrams. For the internal photon 
contractions the direction is immaterial (the photon propagator is an even function 
of x —x’). 

The terms thus obtained include equivalent terms which differ only in that the 
vertices are renumbered, i.e. that the correlation between the vertices and the 
variables is changed, or simply that the variables of integration are renamed. The 
number of such interchanges 1s n!; this cancels the factor 1/n! in (77.1), and there is 
then no need to consider diagrams differing only by interchange of vertices. This 
has already been noted in §§73 and 74. For example, there are two equivalent 
diagrams in the second approximation: 


p< (77.5) 


(DAdv) (PAY) = 
[a | 


In (77.4) and (77.5) only internal contractions which correspond to internal 
diagram lines are shown (virtual electrons and photons). The operators still free are 
contracted with external operators, and this establishes a correlation, between the 
free ends of the diagrams and certain initial and final particles. Then w& (contracting 
with operators 4d; or b;) gives the final electron line or the initial positron line, and 
+ (contracting with ad? or b;) gives the initial electron line or the final positron line. 
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The free operator A (contracting with ¢} or ¢;) can correspond to either an initial 
or a final photon. Thus we obtain sets of several topologically identical diagrams 
(i.e. diagrams having the same number of lines arranged in the same way), differing 
only by interchanges of initial and final particles between incoming and outgoing 
free ends. 

Each such interchange is clearly equivalent to a certain interchange of the 
external operators 4, b,... in (77.1). It is therefore evident that, if the initial 
particles or the final particles include identical fermions, diagrams which differ by 
an odd number of interchanges of free ends must have opposite signs. 

An uninterrupted sequence of continuous lines in the diagrams constitutes an 
electron line along which the arrows maintain a constant direction. Such a line may 
have two free ends or form a closed loop. For example, the diagram 


Way ay = sO 


has a loop with two vertices. The maintenance of direction along the electron line is 
the graphical expression of the conservation of charge: the “incoming” charge at 
each vertex is equal to the “outgoing” charge. 

The arrangement of the bispinor indices along the continuous electron line 
corresponds to writing the matrices from left to right in motion contrary to the 
arrows. The bispinor indices of different electron lines can never become confused. 
Along an open line, the sequence of indices terminates at the free ends with 
electron (or positron) wave functions; in a closed loop, the sequence of indices 1s 
itself closed, and the loop corresponds to the trace of the product of the matrices 
found on it. This trace must be taken with negative sign, as is easily seen. A loop 
with k vertices corresponds to a set of k contractions: 


nn en | 


GaGa GAY 


(or to another which is equivalent, differing only in an interchange of the vertices). 
In the (k — 1)th contraction the operators i and wW are together in the order (wb to 
the right of W) i in which they must appear in the electron propagator. The operators 
at the ends are brought together by an even number of interchanges with other 
w-operators, and are then in the order wi. 

Since 


(O[T ep’ y|0) = — (0|Tyxp'|0) 


(see the second footnote to $74), the replacement of this contraction by the 
corresponding propagator means a change in the sign of the whole expression. 

In general, the change to the momentum representation is made in an exactly 
similar manner to that in §§73 and 74. As well as the general law of conservation of 
4-momentum, “‘conservation laws’’ must also be satisfied at each vertex. But all 
these laws may not suffice to determine uniquely the momenta of all the internal 
lines in the diagram. In such cases, there remain integrations over d‘p/(27r)‘ for all 
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the undetermined internal momenta; these integrations extend throughout p-space, 
including po from — to +. 

In the above discussion it has been assumed that the perturbation is represented 
by the interaction between those particles which are ‘“‘actively” concerned in the 
reaction (i.e. between particles whose state is altered as a result of the process). A 
similar treatment can be given for the case where there is an external electromag- 
netic field, i.e. a field generated by “‘passive”’ particles, whose state is not altered in 
the process. 

Let A(x) be the 4-potential of the external field. It appears in the Lagrangian 
of the interaction together with the photon operator A, as the sum A+ A® (which 
is multiplied by the current operator j). Since A does not involve any operators, it 
cannot contract with other operators. Thus only external lines in Feynman 
diagrams can correspond to an external field. 

If A“ is expressed as a Fourier integral: 


AMO) = | AM) d'glany, 
(77.6) 
A(q) = | A(x) e'™ d*x, 


the expressions for the matrix elements in the momentum representation will 
contain the 4-vector q together with the 4-momenta of other external lines 
corresponding to real particles. Each such external-field line can be correlated with 
a factor A“(q), and the line is to be regarded as ‘“‘incoming”’ (in accordance with 
the sign of the exponent in the factor e ‘” which accompanies A“(q) in the Fourier 
integral; an “‘outgoing line’? would be correlated with a factor A*(q)). If the 
4-momenta of all the external-field lines are not uniquely defined (for given 
4-momenta of all the real particles) by the law of conservation of 4-momentum, 
then there remain integrations over d‘q/(27)* for all the ‘“‘free’’ q and over all the 
other undetermined 4-momenta of the diagram lines. 

If the external field is independent of time, then 


Aq) = 2775(q*)A“(Q), (77.7) 


where A‘(q) is the three-dimensional Fourier component: 
A(q) = | Ap) ei By, (77.8) 


In this case the external line is correlated with A“(q) and assigned a 4- 
momentum q” = (0, q); the energies of the electron lines which (together with the 
field line) meet at a vertex will be equal by virtue of the conservation law. 
Integration over d*p/(27)> is necessary for the other “free” three-dimensional 
momenta p of the internal lines. The amplitude M,;; thus calculated determines, for 
example, the scattering cross-section by (64.25). 

We may give a list of final rules for the diagram technique whereby an 
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expression may be obtained for the scattering amplitude (or rather for iM;;) in the 
momentum representation. 

(1) The nth approximation of perturbation theory corresponds to diagrams with 
n vertices, each of which is the meeting point of one incoming and one outgoing 
electron line (continuous) and one photon line (broken). The amplitude of the 
scattering process involves all the diagrams having free ends (external lines) equal 
in number to the initial and final particles. 

(2) Each incoming continuous external line is associated with the amplitude 
u(p) of an initial electron or u(—p) of a final positron (where p is the 4-momentum 
of the particle). Each outgoing continuous line is associated with the amplitude 
u(p) of a final electron or &(—p) of an initial positron. 

(3) Each vertex is associated with a 4-vector —iey”". 

(4) Each incoming broken external line is associated with the amplitude 
V(47)e, of an initial photon, and each such outgoing line with the amplitude 
V(47)e* of a final photon, where e is the polarization 4-vector. The vector index p 
is the same as the index of the matrix y* at the corresponding vertex, so that the 
scalar product ye or ye* is obtained. 

(5) Each continuous internal line is associated with a factor iG(p), and each 
broken internal line with a factor —iD,,(p). The tensor indices p, v are the same as 
the indices of the matrices y", y” at the vertices joined by the broken line. 

(6) The arrows have a constant direction along any continuous sequence of 
electron lines, and the arrangement of the bispinor indices along them corresponds 
to writing the matrices from left to right in motion contrary to the arrows. A closed 
electron loop corresponds to the trace of the product of the matrices found on it. 

(7) At each vertex, the 4-momenta of the lines which meet there satisfy a 
conservation law, i.e. the sum of the momenta of the incoming lines is equal to the 
sum of the momenta of the outgoing lines. The momenta of the free ends are given 
quantities (subject to the general conservation law), with momentum —p assigned 
to the positron line. Integration over d“p/(27)‘ is carried out for the momenta of 
internal lines which remain undetermined after application of the conservation laws 
at every vertex. 

(8) An incoming free end corresponding to an external field is associated with a 
factor A“(q); the 4-vector q is related to the 4-momenta of the other lines by the 
conservation law at the vertex. If the field is constant, the line is associated with a 
factor A®(q), and integration over d’p/(27)* is carried out for the three-dimen- 
sional momenta of internal lines which remain undetermined. 

(9) An additional factor —1 1s included in iM,; for each closed electron loop in 
the diagram and for each pair of positron external lines if these are the beginning 
and end of a single sequence of continuous lines. If the initial particles or the final 
particles include more than one electron or positron, the diagrams differing by an 
odd number of interchanges of identical particles (i.e. of the corresponding external 
lines) must have opposite signs. 

To clarify the last rule, it may be added that diagrams having the same 
continuous lines, i.e. diagrams which would be identical after removal of all photon 
lines, must always have the same sign. When identical fermions are present, the 
sign of the amplitude as a whole is arbitrary. 
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§78. Crossing invariance 


The representation of the scattering amplitudes M,;; by Feynman integrals 
reveals the following noteworthy symmetry property of these amplitudes. 

Any of the incoming external lines in a Feynman diagram may be regarded 
(without changing the direction of its arrow) as either an initial particle or a final 
antiparticle, and any outgoing line as either a final particle or an initial antiparticle. 
When the change is made from particle to antiparticle, the significance of the 
4-momentum p assigned to the line also changes: p = p, for the electron (say), and 
p =~—p, for the positron. The polarization assigned to the particle is also changed. 
Since an incoming external line must correspond to an amplitude u and an outgoing 
one to u*, we have u=u, for the electron and u =u for the positron; and the 
change from u to u* signifies a change in the sign of the spin component (or the 
helicity) of the particle. 

For the photon, a strictly neutral particle, this is simply a change from emission 
to absorption or vice versa: an external photon line with momentum k corresponds 
either to the absorption of a photon with momentum k, = k, or to the emission of a 
photon with momentum k, = —k and the opposite helicity. 

This change in the significance of the external lines is equivalent to a change 
from one cross-channel of the reaction to others. Hence it follows that the same 
amplitude, as a function of the momenta of the free ends of the diagrams, describes 
every channel of the reaction.t Only the meaning of the arguments of the function 
varies with the channel: the change from particle to antiparticle implies p; > —p,, 
where p; is the 4-momentum of the initial particle (in one channel) and p; the 
4-momentum of the final particle (in the other channel). This property of the 
scattering amplitude is called crossing symmetry or crossing invariance. 

In terms of the invariant amplitudes defined in §70 as functions of the kinematic 
invariants, we can say that these functions will be the same for all channels, but for 
each channel their arguments will take values in the corresponding physical region. 
Thus the Feynman integrals determine the invariant amplitudes as analytic func- 
tions; their values in the various physical regions are the analytical continuation of 
a function specified in one region. Since the integrands in the Feynman integrals 
have singularities, so do the invariant amplitudes, and their singularities can be 
determined from the expressions for the integrals, using the rule of pole avoidance. 
If the invariant amplitudes are calculated for any one channel from the Feynman 
integrals, their analytical continuation to the other channels will necessarily take 
account of these singularities. 

It should be emphasized that crossing invariance goes beyond the properties of 
the scattering matrix which follow from the general requirements of space-time 
symmetry. The latter imply the equality of amplitudes for processes which differ by 
the interchange of initial and final states and the replacement of all particles by 
antiparticles (with the momenta p of all particles unchanged and the signs of their 
angular momentum components reversed). This is the condition of CPT in- 


+ If a particular channel is forbidden by the conservation of 4-momentum, the transition probability 
is necessarily zero because of the delta function which appears as a factor in (64.5). 
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variance.{ Crossing invariance, however, allows this transformation not only for all 
the particles at once but also for any one particle. 


§79. Virtual particles 


The internal lines in the Feynman diagrams play a role in invariant perturbation 
theory analogous to that of the intermediate states in the “‘ordinary” theory, but the 
nature of these states is different in the two theories. In the ordinary theory the 
(three-dimensional) momentum is conserved in the intermediate states, but the 
energy is not, and for this reason they are said to be virtual states. In the invariant 
theory, the momentum and the energy appear on an equal footing: in the inter- 
mediate states, the whole 4-momentum is conserved (this results from the fact that 
the integration in the S-matrix elements is over both coordinates and time, thus 
ensuring the invariance of the theory). But the relation between energy and 
momentum which holds for real particles and is expressed by the equation p* = m’ 
is no longer satisfied in the intermediate states, which are therefore spoken of as 
intermediate virtual particles. The relation between the momentum and energy of a 
virtual particle may be anything required by the conservation of 4-momentum at 
the vertices. 

Let us consider a diagram consisting of two parts I and II, joined by a single 
line. Ignoring the internal structure of these parts, we can represent the diagram in 
the schematic form 


(79.1) 


(the lines shown may be either continuous or broken lines). By the general 
conservation law, the sums of the 4-momenta of the external lines for parts I and II 
are equal. Because of the conservation at each vertex, they are also equal to the 
4-momentum p of the internal line joining parts I and II. Thus this momentum is 
uniquely defined, and there is therefore no integration with respect to it in the 
matrix element. 

The quantity p* may be either positive or negative, depending on the reaction 
channel. There is always a channel in which p?>0.+ Then the virtual particle is 
entirely analogous, as regards its formal properties, to a real particle with real mass 
M= Vp’. Its rest frame can be defined, its spin determined, and so on. 

The tensor structure of the photon propagator (76.11) is the same as that of the 
density matrix of an unpolarized particle with spin 1 and non-zero mass: 


Puv = (Sur >. Pup,/m’) 


+ The formal description of the change from one of these reactions to the other by reversing the 
signs of all the 4-momenta in the Feynman diagrams corresponds to the significance of the operation 
CPT as 4-inversion. 

+ For example, the channel (if it is allowable on energy grounds) in which all the free ends of part I 
correspond to initial particles and those of part II to final particles. Then p =P; (the sum of the 
4-momenta of all the initial particles), and in the centre-of-mass system p = (P}, 0), so that p’?>0. 
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(see (14.15)). For a virtual particle the propagator (a quantity obtained from a 
quadratic combination of the field operators) plays a role analogous to that of the 
density matrix for a real particle. Thus a virtual photon, like a real photon, must be 
assigned spin |. But, unlike the two independent polarizations of the real photon, 
all three polarizations are possible for the virtual photon, which is a “‘particle”’ with 
finite mass. 

The electron propagation function 1s 


Ga«ypt+m, 


where m is the mass of the real electron, the “‘mass”’ of the virtual particle being 
M = Vp’. Putting 


M+m M-m 
5M (yp + M)+ M 


yp +m = (yp — M), (79.2) 


we see that the first term corresponds to the density matrix of a particle with mass 
M and spin 3, and the second term to that of a similar ‘“‘antiparticle’’; cf. (29.10) and 
(29.17). Since the particle and the antiparticle have different internal parities ($27), 
we conclude that the same spin 3 must be assigned to the virtual electron, but that 
no definite parity can be assigned to it. 

A characteristic feature of the diagram (79.1) is that it can be cut into two 
unconnected parts by dividing only one internal line.t This line corresponds, in 
such cases, to a one-particle intermediate state, i.e. a state having only one virtual 
particle. The scattering amplitude corresponding to such a diagram contains the 
characteristic factor (which does not undergo integration) 


p>—m?+i0’ 


arising from the internal line p; m is the electron mass for an electron line and zero 
for a photon line. Thus the scattering amplitude has poles at the values of p for 
which the virtual particle would become a physical one (p* = m’). This situation is 
similar to the one in non-relativistic quantum mechanics, where the scattering 
amplitude has poles for energy values corresponding to bound states of the system 
of colliding particles (QM, §128). 

Let us consider the diagram (79.1) for the reaction channel in which all the free 
ends on the right correspond to initial particles, and all those on the left to final 
particles; then p’>0. Then we can say that, in the intermediate state, all the initial 
particles are converted into one virtual particle. This is possible only if such a 
conversion would not contradict the necessary conservation laws (not including the 
conservation of 4-momentum), namely the conservation of angular momentum, 
charge, charge parity, etc. This is the necessary condition for the occurrence of 
what are called pole diagrams. If these exist for one reaction channel, they exist 
also for the remaining channels, because of crossing invariance. 


+ This property occurs for the diagrams of almost all processes in the first non-vanishing ap- 
proximation. 
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For example, the conservation laws mentioned do not preclude the formation of 
a virtual electron by e+ y-—e. This corresponds to a pole of the Compton effect 
amplitude (and therefore a pole of the other channel of this reaction, namely 
two-photon annihilation of an electron—positron pair). The formation of a virtual 
photon by e +e*-—y corresponds to a pole of the amplitude for the scattering of 
an electron by a positron, and therefore that of an electron by an electron. Two 
photons can give neither a virtual electron nor a virtual photon: the conversion 
y+y-—e is forbidden by the conservation of charge or angular momentum, and 
yt+ty-—y by that of charge parity. Accordingly, the photon—photon scattering 
amplitude cannot involve pole diagrams. 

The origin of the pole singularities of the scattering amplitudes, which has been 
discussed above on the basis of Feynman integrals, is really more general and is 
not dependent on perturbation theory. We shall show that such singularities arise 
simply as a consequence of the unitarity condition (71.2). 

Let us assume that the intermediate states n which appear in (71.2) include a 
one-particle state. The contribution of this state is 


3 
(Tr- TH” = in) S | 8° ,— pT aTA BAS. 
r (277) 


where p and A are the 4-momentum and helicity of the intermediate particle. The 
integration over d’p is replaced by one over d“p (in the range py = € > 0): 


d>p >2&8(p*— M’) d‘p, 


where M is the mass of the intermediate particle. The integration eliminates the 
delta function 6(P; — p); we then change from the amplitudes Tj to Mj by (64.10), 
obtaining 


(M,; — M#)°"?) = 27i8(p?— M2) © M;,M&. (79.3) 
A 


Assuming T and P invariance, we have (apart from a phase factor) Mis = Mf, 
where the states i’, f’ differ from i, f only in the sign of the particle helicities (with 
the same momenta). Taking the sum of equation (79.3) and the corresponding 
equation for M;;,— M#;, we have 


im M 9") = —@5(p*— MR, (79.4) 


where 


M 5 = Mg + Mp, 


R =— >) (Myp.M * + My,M3,). 


rX 


Hence it follows that M,, as an analytic function of p?= P?= P}, has a pole at 
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p’=M°. According to (75.18) the pole part is 


R 


M pre) =~ 


Real transitions to a one-particle state are possible only for one value of P; = Pj, 
namely M’. Thus we in fact obtain the scattering amplitude structure correspond- 
ing to a diagram of the form (79.1). 

Finally, let us consider an important property of diagrams containing closed 
electron loops. This property is easily derived by applying the concept of charge 
parity to a virtual photon: a virtual photon, like a real photon, must be assigned a 
definite (negative) charge parity.t 

If a diagram contains a closed loop (with number of vertices N > 2), the 
amplitude for the process concerned must include not only that diagram but also 
another which differs only in the direction of traversal of the loop Gif N = 2, there 
is evidently no distinguishable “direction of traversal’). If these loops are “‘cut 
out” along the broken lines which come to them, we obtain two loops, I]; and [y: 


i a aca (79.6) 


which may be regarded as diagrams determining the amplitude for the process of 
conversion of one set of photons (real or virtual) into another; the number N is the 
sum of the numbers of initial and final photons. But the conservation of charge 
parity forbids the conversion of an even number of photons into an odd number. 
When N is odd, therefore, the sum of the expressions corresponding to the loops 
(79.6) must be zero. The total contribution to the scattering amplitude from two 
diagrams containing these loops as constituent parts is consequently zero also, a 
result known as Furry’s theorem (W. H. Furry, 1937). 

Thus, in constructing the amplitude for a given process, we can ignore diagrams 
containing loops with an odd number of vertices. 

This cancellation of diagrams occurs for the following reason. A closed electron 
loop corresponds to an expression (with given momenta ki, k2,...,ky of the 
photon lines) 


[ at -ulyenG@ryedG@ +k). (79.7) 


where p,p +ki,... are the momenta of the electron lines (which are not completely 
determined after the conservation laws have been applied at the vertices). Let the 
operation of charge conjugation be applied to all matrices y* and G, replacing them 


+ This follows from the same arguments as were given at the end of $13 for a real photon, 
concerning the electromagnetic interaction operator acting at each vertex. 
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by Uc'y"Uc and Uc'GUc. The expression (79.7) is then unchanged, since the trace 
of a product of matrices is unaffected by such a transformation. According to 


(26.3), 


Uely"Uc = ~ 94, (79.8) 
and hence 
Ue'G(p)Uc = EX = G-p). (79.9) 


But the replacement of G(p) by the transposed matrix with the sign of p changed 1s 
clearly equivalent to a change in the direction of traversal of the loop, all the 
arrows being reversed. Thus this transformation changes one loop into the other, 
and there is a factor (-1)" from the change (79.8) at each vertex. Hence 


fl, = (- 1). On, (79.10) 


i.e. the contributions from the two loops are the same when the number of vertices 
is even, but equal and opposite when this number is odd. 


CHAPTER IX 


INTERACTION OF ELECTRONS 


§80. Scattering of an electron in an external field 


ELASTIc scattering of an electron in a constant external field is a simple process 
which occurs even in the first approximation of perturbation theory (the first Born 
approximation). It corresponds to a diagram with one vertex: 


Va 


ie (80.1) 


p p 


where p and p’ are the initial and final 4-momenta of the electron, and g = p'— p. 
Since the electron energy is conserved in scattering in a constant field (e = e'), we 
have q = (0, q).? 

The corresponding scattering amplitude is 


Mj: = — eti(p')[yA(q)]u(p), (80.2) 


where A“(q) is the component of the spatial Fourier resolution of the external 
field. The scattering cross-section is, according to (64.26), 


_ 1 A { ne ! 
For an electrostatic field, A = (A§”, 0), and hence 


Myi = — eit(p')y’u(p)Ap(q) 


— eu*(p')u(p)Ap°(q). (80.4) 


In the non-relativistic case, the bispinor amplitudes u(p) of the plane waves reduce 
to the non-relativistic (two-component) amplitudes. For scattering without change 
of polarization, u'’= u, and u*u = 2m by the normalization condition chosen. Thus 


2 
do = | =e: va) do’, 


+ When there is an external field, such a diagram is, of course, not forbidden by the law of 
conservation of 4-momentum, as the diagram (73.19) with a real photon was: q’, unlike the square of the 
4-momentum of a real photon, need not be zero, and the component with the necessary q Is 
automatically taken from the Fourier integral which represents the external field. 
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where U(q) = eA§(q) is the Fourier component of the potential energy of the 
electron in the field; this expression is the same as the familiar Born’s formula 
(QM, (126.4)). 

In the general relativistic case, the cross-section for scattering of unpolarized 
electrons is obtained by averaging |M,)|° over initial polarizations and summing over 
final polarizations, i.e. by taking the quantity 


; > |Myil*, 


polar. 


where the summation is over the spin directions of the initial and final electrons; 
the factor 3 changes one of these summations into an averaging. According to the 
rules given in $65, we obtain 


>> |Myil’ = 2 tr p'(yA®)p(yA*) 


polar. 


= 3|A)(q)|’ tr (m + yp’) y°(m + yp)y°. 


To calculate the trace, we note that y(yp)y°=(yp), where p =(e,—p), and 
therefore 


atr(m + yp')y°(m + yp)y° =i tr(m + yp')(m + yp) 
=m* + p'p 
= e*+m*+ p:p 
= 2¢*—I¢q°. 
Hence the cross-section is 


2 (e) 2 2 
dg NT oj = ay (80.5) 
At Ag 


For a field due to a static distribution of charge with density p(r), we have 


Ay’(q) = 4ap(q)/q’, (80.6) 
where p(q) is the Fourier transform of the distribution p(r) (the form factor). In 


particular, for the Coulomb field of a point charge Ze we have p(q)= Ze. The 
cross-section is then 


I eee 2 
do = doe (1 - 7) (80.7) 


(N. F. Mott, 1929). The quantity q’ = 4p’ sin’ 30, where @ is the scattering angle. The 
angular dependence of the quantity preceding the parenthesis is therefore that of a 
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Rutherford cross-section: 


2\2..2 
dan = do SEEPS 


2\2..2 
inva or sin“ 10: (80.8) 


in the non-relativistic limit, e?/p*— 1/m’v*. Thust 
do = dor, (1 — v* sin’ 36). (80.9) 
In the ultra-relativistic case, the angular distribution differs from the non-relativis- 


tic case in that there is much less backward scattering: as 09> 7, da/dap,—> m’/e?. 
In the ultra-relativistic case, formula (80.7) gives for small-angle scattering 


do’. (80.10) 


Although this formula has been derived in the Born approximation (1.e. on the 
assumption that Ze’ <1), it remains valid (for angles 6 < m/e) even if Ze’ ~ 1. This 
can be seen by using the ultra-relativistic wave function #$) (39.10), which is exact 
as regards Ze’. This solution, which is valid in the range (39.2), of course remains 


valid in the asymptotic range r->~. Here 


a-VF 


€ 


~1—cosd~@’°<1, 


F ~ 1+ constant x e'?" ?'”, 


so that the correction term remains small, as it should. The wave function of the 
form e'?'' F, which has the same form as the non-relativistic function (with an obvious 
change of parameters), has the same asymptotic expression, and therefore the 
cross-section is given by the Rutherford formula. 

To calculate the scattering cross-section for electrons with any polarization, we 
could use the density matrix (29.13), following the general procedure. In this case, 
however, the result can be more readily obtained by expressing the bispinor 
amplitudes u(p’) and u(p) in the form (23.9). Multiplication gives 


u*(p')u(p)=w'*{e+m+(e—m)n'-o)(n- o)}w, 
or, using (33.5), 


u*(p')u(p)= w’* fw, (80.11) 


+ The difference between do and dog, shown by this formula is specific to particles with spin 3. In 
the scattering of particles with spin 0, if their motion in the electromagnetic field is described by the 
wave equation, the result is do = dogu. At first sight it might appear puzzling that the factor expressing 
this purely quantum effect does not contain f. However, it must be remembered that the 
condition for the Born approximation to be valid (e*/hv <1) is contrary to the condition for quasi- 
classical motion in a Coulomb field, and therefore formula (80.9) cannot be taken to the classical limit. 
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wheret 


f=A+Bv-a, 


A=(e+m)+(e—m)cos 0, 
(80.12) 
B= —i(e—m)sin 0, 


vy=nXn’'/sin 86. 


The two-component quantity (three-dimensional spinor) w is the non-relativistic 
spin wave function of the electron. The change to the partially polarized states is 
therefore made by replacing the products w.w% (where a, B are spinor indices) by 
the non-relativistic two-rowed density matrix p,g. Thus we must put 


|M,i|’ > e?|Ab(q)/ tr p(A — Bv- a)p'(A+ Bv-o), 
where 
p=xl+o-0, p'=1l+o-f), 


and ¢, ¢ are the vectors of the initial polarization and the final polarization selected 
by the detector. The result of calculating the trace is 


do = dro} lA BID Es Gr ZB Du ne) 28 Bie bE ed (80.13) 


A’+|BP 


where dap» is the scattering cross-section for unpolarized electrons. 

Expressing the quantity in the braces in (80.13) in the form {1+ (C”-¢}, we find 
the polarization ¢” of the final electron itself, as opposed to the detected 
polarization £’ (see §65):+ 


pn —(A’= BY’)C+2|B’w-Ovt+2AlBly xf (80.14) 


A’+|B/’ 


We see that the scattered electrons are polarized only if the incident electrons are 
polarized. This is a general property of the first Born approximation; cf. QM, $140. 
In the non-relativistic case (e > m), (80.14) gives C” = , i.e. the electron retains 
its polarization on scattering, a natural consequence of the neglect of spin—orbit 
interaction. 
In the opposite (ultra-relativistic) case, we have 


A= e(1+cos 6), B = —ie sin 0, 
in accordance with the general formula (38.2). 


+ The definition of f used here differs by a factor from that in §§37 and 38. 
+t Formula (80.14) corresponds to that derived in QM, §140, Problem 1, and is obtained from it by 
taking A real and B imaginary. 
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If the incident electron has a definite helicity (C= 2An, A = +3), (80.14) gives 
after a simple calculation 


CO = 2)rn’. 


Thus the electron remains helical after scattering, with the same value (A) of the 
helicity. 

This property occurs because, as already mentioned in §38, when the mass is 
neglected Dirac’s equation in the spinor representation separates into two in- 
dependent equations for the functions € and 7. The result has also a more general 
significance, since the current 


}=(E*Ety*n, E*oE-n*on), 


and therefore the electromagnetic perturbation operator V = ejA, do not contain 
mixed terms in € and », and thus have no matrix elements for transitions between & 
states and 7 states. Hence it follows that, if an ultra-relativistic electron has a 
definite helicity (i.e. if either y» or € is non-zero), this helicity is conserved in 
interaction processes in an approximation corresponding to completely neglecting 
the electron mass. 


381. Scattering of electrons and positrons by an electron 


Let us consider the scattering of an electron by an electron, in which two 
electrons with 4-momenta p,, p2 collide and emerge with 4-momenta pj, p3. The 
conservation of 4-momentum is expressed by 


Pit po= pit p>. (81.1) 
We shall use the kinematic invariants of 866, defined by 
S =(pi+ pr) = 2(m’ + pip), 
t = (p1— pi) = 2(m*— pip)), 


u = (pi — p3) = 2(m?— pip3), 
st+tt+u=4m’. 


(81.2) 


The process in question is represented by the two Feynman diagrams (73.13), 
(73.14), and its amplitude ist 


I FF; as tal J Vv keel f 
Mj = 4ne}s (Usy" Ur) (UYU) — 7 (uy us\isy.u)f (81.3) 


+ This form of My is in accordance with the general expression (70.5). In the first non-vanishing 
approximation of perturbation theory, only one of the five invariant amplitudes is non-zero: f3(t, u) = 
4re‘/t. 
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According to the rules given in §65 for the states of initial and final particles 
described by polarization density matrices pj, pj,..., we make the change 


1 f Vv / 1 / Vv / 
Myi|" > lone'| tr (p2y"poy”) tr (pi YuPry) +73 tr (piy"p2y”) tr (p2yup1yv) — 
1 a voy I ae Pat 
— Fy Ce2y"e2y"piyupry) — Fo tr (piy"p2y'p2Y.Pry)(- (81.4) 


For the scattering of unpolarized electrons (without regard to their polarization 
after scattering), we must put for all the density matrices p =x(yp +m), and 
multiply the result by 2 x 2 =4 (averaging over the polarizations of the two initial 
electrons, and summation over the polarizations of the two final electrons). The 
scattering cross-section is given by formula (64.23), in which, by (64.15a), I? = 
48(s —4m’). It may be written 


4 
dor = dt ATs (F(t, u) + a(t, u) + flu t) + eu, Ob 
f(t, u) = i tr [(yp2t+ m)y"(yp2t+ m)y")] tr ((ypit+ m)y.(ypitm)y,],7 (81.5) 


i / Vv i 
g(t,u) = Téetu tt [(yp2+m)y"(yp2+ m)y"(ypit+ m)y,(ypit m)y,]. 


In f(t, u) the traces are first calculated (using (22.9), (22.10)), followed by sum- 
mation over p and v;t in g(t, u) the summation over p and v is taken first, using 
formulae (22.6). The result is 


2 
f(t, u) ==3 [(pip2)’ + (pips) + 2mm? — pip))I, 
t 
2 2 
g(t, u) =F (pip2— 2m )(P 1P2), 
or, in terms of the invariants (81.2), 
F(t, w) = Ls? + w2) + amt — m?)], 
(81.6) 


g(t, u) = gu, t) == Gs —m?)4s — 3m’), 


+ The following formula is given for future reference: 


4 tr (ypi+m)y"(yp2t m)y” = g""(m?— pipr) + pp + pips. 
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Thus the cross-section 1s 


_ yan att i722 2 OR ed 
0 Tere Wana) 2 Ls +u‘)+4m (t m~)| + 
< [s?+ 1)+4m(u— m?)] -2 Gs — m’)Qs — 3m?)}, (81.7) 


where r, = e”/m. 
In the centre-of-mass system, we have 
s=4e°, t=-—4p’sin30, u=—4p’ cos” 36, | cos 
— dt = —2p’d cos 6 =(p’/7) do, | ) 
where |p| and e are the magnitude of the momentum and energy of the electrons, 


which are unchanged in the scattering, and @ is the scattering angle. In the 
non-relativistic case (« ~ m),t we obtain for the cross-section 


4 
= rm (o 23-5) 
=( e° ) ( 1 : 1 1 ) a 
~ \mv2/) \sin‘4@  cos‘l@  sin?46@ cos*1@/ °° 


2 \2 2 
= (= at eos) do (non-relativistic), (81.9) 


mv? sin’ 6 


where v= 2p/m is the relative velocity of the electrons, in accordance with the 
non-relativistic theory (see QM, $137). In the general case of arbitrary velocities, 
formula (81.7) with the substitution (81.8) can easily be brought to the form 


27.21 22 2 \2 
_ 29m (e +p’) 4 3 ( Pp ( 4 )] 
BOr> Me Ap*e° sin’@ sin’?@ \e2+ p ry sin’ 6 a Ch 


(C. Mgller, 1932). In the ultra-relativistic case (p’ ~ €”), 


2 2 2 

_ 2m‘ (3+cos’ 6) saacigene 

do =r: a eg do (ultra-relativistic). (81.11) 

In the laboratory system, where one of the electrons (say electron 2) is at rest 
before the collision, the cross-section can be expressed in terms of the quantity 


Aa Stee (81.12) 


+ The velocity v is assumed small (v <1) but such that the condition for perturbation theory to be 
applicable is still satisfied: e’/v (=e7/hv) <1. 
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the energy (in units of m) transferred by the incident electron (electron 1) to 
electron 2.+ The invariants are 


s =2m(m + €)), t= —2m7A, u= —2m(e,—m—maA). (81.13) 


Substitution of these expressions in (81.7) gives the following formula for the 
energy distribution of the secondary electrons (called 6 electrons) formed in the 
scattering of fast primary electrons: 


Pome 2 
do 20 ay lf (81.14) 
where y = €,/m. The quantities mA and m(y — 1-— A) are the kinetic energies of the 
two electrons after the collision; the identity of the two particles is shown here by 
the symmetry of the formula with respect to these quantities. If the term ‘“‘recoil 
electron” is arbitrarily applied to the electron with the smaller energy, A takes 
values from 6 to 3(y — 1). When A is small, formula (81.14) becomes 


y dA _2aredA 


do =2nr. y-le op & 


A<y-1. (81.15) 


This formula, if expressed in terms of the velocity of the incident electron 
(v; = |pi|/e1), retains the same form in the non-relativistic case. Its form is naturally, 
therefore, the same as that of the result given by the non-relativistic theory (cf. 
QM, (148.17)). 

Let us now consider the scattering of a positron by an electron (H. J. Bhabha, 
1936). This is another cross-channel of the same general reaction as the electron— 
electron scattering. If p_, p. are the initial momenta of the electron and positron, and 
p',p: their final momenta, the change from one case to the other is made by the 
substitutions 


POPs PP Pea py Pa pe. 
The kinematic invariants (81.2) become 
s=(p-—piy, t=(pi-pi)’, u=(p-t+p.y. (81.16) 


If ee scattering is the s channel, ée scattering is the u channel of the reaction. The 
square of the scattering amplitude, expressed in terms of s, t and u, remains as 
before; in the denominator of (81.5), s must be replaced by u. Thus the cross- 
section for scattering of a positron by an electron is, instead of (81.7), 


) 4am?’ dt 


Cos u(u — 4m°) 


17 [(s* + u’?)+4m°*(t — m*)] + 
1 hyn? 2 2 2 4 1 2/1 2 
SEs? +) + Amu — my] + Gs — mys —3m)}. (81.17) 


+ The kinematic relations for elastic collisions in various frames of reference are given in Fields, 
§13. 
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In the centre-of-mass system, the values of the invariants s, t, u differ from 
(81.8) by the interchange of s and u: 


s = —4p’cos’ 56, t = —4p’ sin’ 40, u=4e’, (81.18) 


In the non-relativistic limit, formula (81.17) reduces to Rutherford’s formula: 


d = (4) do. “nentelativistio) 81.19 
o> ae sin’20 on-relativistic), (81.19) 


where v=2p/m. This comes from the first term in the braces in (81.17), which 
originates from the “‘scattering”’-type diagram (see §73). The contributions from the 
‘“‘annihilation”’ diagram (the second term in (81.17)) and from its interference with 
the scattering diagram (the third term) vanish in the non-relativistic limit. 

In the general case of arbitrary velocities, the contributions of all three terms in 
(81.17) are of the same order of magnitude; the first term predominates only at 
small angles, because of the factor t ” « sin 430. Combining like terms, we can write 
the cross-section for scattering of a positron by an electron (in the centre-of-mass 
system) in the form 


> 2) 2 2\2 4 4 
= do tem (eit py 1 Beta m* 1 
do = do ea | p’sin’J0 pre’ sin’]O” 
4 4 oo grees 2 4 
, We ie ape"+p) sin? 49 +B. sin‘9|. (81.20) 


The symmetry with respect to 6 and w-—9@ which is typical of scattering 
involving identical particles does not, of course, occur when a positron is scattered 
by an electron. In the ultra-relativistic limit, the expression (81.20) differs from the 
electron-electron cross-section only by the factor cos* 46: 


do. =cos’30do,. (ultra-relativistic). (81.21) 


In the laboratory system, where one of the particles (say the electron) is at rest 
before the collision, we again define 


A=-t t= (81.22) 


i.e. the energy transferred by the positron to the electron. As in (81.13), we now 
have 


s = —2m(e,—m—mA), 


t= —2m’A, u=2m(m + €,). 
Substitution of these expressions in (81.17) easily gives the following formula for 


+ See (83.4) and (83.20) for the passage to the non-relativistic limit in the scattering and annihilation 
terms in the scattering amplitude. The latter term contains a factor 1/c’, and therefore tends to zero. 
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the energy distribution of the secondary electrons: 


| eS a ee 


, dA Fe 2y°+4y +11 3y°+6y+4 2y 1 a 
—1\|A’ yt+1 A (y +1) (y +1) (y +1) ‘ 


(81.23) 


where y = €./m; A varies from 0 to y — 1. When A < y — 1, (81.23) leads to the same 
formula (81.15) as for electron scattering. 

The polarization effects in the scattering of electrons or positrons are calculated 
by the general rules given in §65. In all but special cases, the resulting formulae are 
lengthy. Here we shall give only some comments.t 

In the approximation considered (the first non-vanishing approximation of 
perturbation theory), the cross-section contains no terms linear in the polarization 
vectors of the initial or final particles. As in the non-relativistic theory (QM, $140), 
such terms are forbidden in consequence of the requirement for the scattering 
matrix to be Hermitian. The scattering cross-section is therefore unchanged if only 
one of the colliding particles is polarized; and unpolarized particles do not become 
polarized as a result of scattering. 

The same conditions prohibit correlation terms in the cross-section which 
contain the products of the polarizations of three of the particles (initial and final) 
concerned in the process. The cross-section does, however, contain double and 
quadruple correlation terms. In the scattering of unlike particles (electron and 
positron, electron and muon), these terms vanish in the non-relativistic limit, since 
there is no spin-orbit interaction. In collisions of like particles, however, there are 
correlation terms even in the non-relativistic case, because of exchange effects. 


PROBLEMS 


PROBLEM 1. Determine the scattering cross-section for polarized electrons in the non-relativistic 
case. 


SOLUTION. In the non-relativistic case, the bispinor amplitudes in the standard representation have 
two components, and the density matrices are the two-rowed matrices (29.20). In the scattering 


amplitude (81.3), the only non-zero terms are those with » = v =0, which contain matrices y° that are 
diagonal (in the standard representation). Instead of (81.4) we have 


p> [Myil’ = 16q°e" - am'| (2+) trd+o:)trdt+o: t)-= til +o-j(lt+o: “| 


1 1 1 
=16n'e*-4 ‘al + —-— 
67°e' -4m a alr (1+01-&)], 


the summation being over the polarizations of the final electrons. Hence the scattering cross-section is 


sin’ @ 
do = doo( 1 Tue eoeee Gi: tr) 


where @ is the scattering angle in the centre-of-mass system and do» the scattering cross-section (81.9) 
for unpolarized particles. For completely polarized electrons, this formula is the same as the result in 


+ For further details see the paper by W. H. McMaster, Reviews of Modern Physics 33, 8, 1961. 
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QM, §137, Problem, with |€,| = |@| = 1, 1-2 = cosa, where a is the angle between the directions of 
polarization of the electrons. 

For the scattering of positrons by electrons, there is no dependence on the polarization in this 
approximation (do = dao); this is easily seen by noticing that, in the non-relativistic limit, different pairs 
of components are non-zero in the electron and positron amplitudes up, and u-p. 


PROBLEM 2. In the non-relativistic case, determine the polarization of scattered electrons in the 
scattering of an unpolarized beam by a polarized target. 


SOLUTION. We can calculate the scattering cross-section for given initiai polarization (: and 
detected final polarization ¢1; only the polarization of one final electron is detected. By the same method 
as in Problem 1, we find 


do = ido | — tp 2808 8 = cos my] 


1+3 cos é 
The polarization vector of the scattered electron is therefore 


(= 4 2 cos 0(1 — cos 6) 
 14+3c08 6 


PROBLEM 3. In the non-relativistic case, determine the probability of spin reversal of a completely 
polarized electron scattered by an unpolarized electron. 


SOLUTION. We similarly find the cross-section for given polarizations ¢: and €): 


do =2dao [ +01 °C) 2 cos @(1 + cos |. 


1+3 cos’ 6 


Putting €:- Ci = —1, we then find the probability of reversal of the spin direction: 


do __(1~cos6) 
doy 2(1+3cos’ 6) 


PROBLEM 4. Determine the ratio of the scattering cross-sections for helical electrons with parallel 
and antiparallel spins, in the ultra-relativistic case. 


SOLUTION. In (81.4) we must put, according to (29.22), 


pi =(ypi)(1 — Ary’), = 3(yp2)(1 — 2d2y’), 


py 
(oh ee = 3p, p2= 2D 3, 


where A1, A2= +, The traces are calculated by the formulae given in §22; in particular, 


tr (y°(ya)y"(yb)y’) te Cy (yc) yulyd) yw) = P(e” abr (Counc “d") 
= 2(825% — 8°53)a,b,c7d" 
= 2(ac)(bd) — 2(ad)(bc). 


The result is 


do stu’ s?4+t? 2s Cai. ig ay” 2s) 
op + |) + Aa (“yp — +$ 4 
dt a t u =) (5 t ue tu 


Since the momenta of the colliding electrons (in the centre-of-mass system) are opposite, antiparallel 
spins correspond to like helicities (A1 = Az), and parallel spins to unlike helicities (A; = — A2). Substituting s, ¢, 
u from (81.8) (with p’ ~ &’), we find the required ratio: 


do tt /do N) =3(1 +6 cos’ 6 + cos’ 6). (1) 


This has its least value, ¢, when 6 =m. 
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PROBLEM 5. The same as Problem 4, but for the scattering of positrons by electrons. 
SOLUTION. In this case we have to calculate, instead of (81.4), 
|My’ > lon°e"| tr(p’— y"p— y") tr (pe yup}y) 4 tr (p:y"p-y"psyuptw) +.. ie 
the remaining terms are obtained by interchanging p; and p/. The density matrices are 


p-=x(yp-)1-2a-y°),  p 
p.=2yp!, p 


§81 


where Aj, A- = +3 (and for the positron, as for the electron, A+ = ) denotes that the spin is parallel to the 


momentum). The result of the calculation is 


at ers ee Sar Sea ere a ec me a 


d Sa Se 0S? Pr SP es 
Fx ( )—4rea-( ) 
t u tu t u tu 


Hence we find for the ratio of cross-sections the same value as formula (1), Problem 4. 


PROBLEM 6. Determine the cross-section for scattering of muons by electrons. 
SOLUTION. The process is described by the one diagram (73.17). Instead of (81.5) we have 


‘dt 
a = Tre t, 
o = Ope me 


Are‘ dt 


“[s—On + pyle Gn py So. 


f(t, u) = a tr [(ypit+ wy (ypu + By’) tr [(ypet+ m) ya (ype + m)y,]; 


(1) 


Pe, Py and pe, p, are the initial and final 4-momenta of the electron and the muon, and m, p their masses. 


The invariants are 
$= (De+ Pu) =m +p t+ 2pDu; 
t = (De — pe) = 2(m* — pepd) 
= 2(u*— Pups), 
=(pe- py =m + pw — 2pPi, 
sttt+u=2(m’*+p’). 


The result of the calculation is 
f = 2 {(peps)? + (Dep jy + (mn? + nyt} 
= Bas? + 02) + (m?+ 221 ~ m?~ 12) 
Formulae (1) and (2) give the solution of the problem. In the centre-of-mass system, 


4 

e° do —»2 > > > Rete 2. soa 
= - i , a 1 
da =3 ée + €,) p sin’ 30 [(ece, +p) + (eee, +p cos 0) —2(m* + w*)p’ sin’ 26] 


(2) 


where do = 27 sin 6 d@; &, €, are the energies of the electron and the muon; p = e2—-m’=e,~—p’. If 
p<’, we return to formula (80.9) for scattering by a fixed centre of Coulomb force. In the 


ultra-relativistic case (p’> pw”), 


pies e* 1+cos* 36 
8p" sin’ 30 
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In the laboratory system (where the electron is at rest before the collision), 


e°\? dA 2 A , mM’ > 
do = 2n(<) Se (1— vig + 52d"), 


Vu Eu 


where e, is the energy of the incident muon, and v, = p,/e€, is its velocity; mA = e. — m = e, — €, is the 
energy of the recoil electron; and 


ne) | 
m+ wo +2me, 


is the maximum value of A. 


PROBLEM 7. Determine the ratio of cross-sections for the mutual scattering of helical electrons and 
muons with parallel and with antiparallel spins, in the ultra-relativistic case (€, > uw, & >m). 


SOLUTION.?| As in Problem 4, we find 
do tt/do ) = cos* 36. 


where 86 is the scattering angle in the centre-of-mass system. 


PROBLEM 8. Determine the cross-section for the conversion of an electron pair into a muon pair (V. 
B. Berestetskii and I. Ya. Pomeranchuk, 1955). 


SOLUTION. This is another cross-channel of the reaction to which pe scattering belongs. In this 
channel, 


S=(pe- pay, t=(pet+ De)’, U = (Pe - Du); 


where Pe, Pe are the 4-momenta of the electron and the positron, and p,, p, those of the muon and the 
antimuon. The reaction threshold corresponds to an energy 2u of the electron pair (in the 
centre-of-mass system), so that we must have t > 4”. In the laboratory system, where the electron is at 
rest before the collision and the positron has energy «:, 


t =2m(e1.+m)=~2me., 


so that we must have e,>e, where the threshold energy «=2yu’/m; here and below, all ap- 
proximations allowed by the inequality ~ >m are made. 
The differential cross-section is (instead of formulae (1) and (2), Problem 6) 


a Ane’ ds f(t, u) 
~ (t—4m’)t a 


~ 4ne* [2(s? +u’)+ 2u°t — pe’). 
For given t, the quantity s takes values between the limits determined by the equations su ~ p‘, 


s+t+u~2p’, ie. 
py? —at —2V[t(t —4p)] <5 <p? —3t +2V[t(t — 4p)]. 


An elementary integration gives 
4 2 2 5 2 
oan (1-H) (14), n= elm; (1) 


in the laboratory system, t = 2me+. This formula is not valid in the immediate neighbourhood of the 
threshold: when ¢+—« ~ pe’, the muons formed cannot be regarded as free particles; when the 
Coulomb interaction between them is taken into account, the cross-section tends not to zero but to a 
constant value as 61> & (see QM, $147). 

The cross-section (1) has its maximum value when «+ = 1.7e. Its maximum value is about 20 times 
less than the cross-section for two-photon annihilation at the same energy. 


+ Another method of solving this problem is given at the end of §144. 
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$82. Ionization losses of fast particles 


Let us consider a collision between a fast relativistic particle and an atom, 
accompanied by excitation or ionization of the atom. Such inelastic collisions for the 
non-relativistic case have been discussed in QM, §§148-150; here we shall derive 
the relativistic generalization of the formulae obtained in QM (H. A. Bethe, 1933). 

The velocity of the particle incident on the atom is assumed large in comparison 
with those of the atomic electrons; thus we always assume that Za <1, 1.e. that the 
atomic number ts fairly small. This condition ensures the applicability of the Born 
approximation to the process under consideration. The solution of the problem 
depends to some extent on whether the fast particle is light (electron or positron) or 
heavy (meson, proton, a-particle, etc.). The second case is the simpler, and will be 
taken here. 

Let p =(e, p) and p’ = (e’, p’) be the initial and final momenta of the fast particle 
in the Jaboratory system, where the atom is at rest before the collision; the 
difference gq = p'—p gives the energy and momentum transferred to the atom by 
the particle. The range of possible momentum transfers can be divided into two 
parts: 


(I) q?/m <m, (II) q?/m > I, (82.1) 


where m is the electron mass and I is a mean energy of the atom, its ionization 
potential. The two parts overlap with I <q’/m <™m; this allows an exact joining of 
the results for each part separately. The momentum transfer will be said to be 
respectively small and large in the two parts of the range. 


SMALL MOMENTUM TRANSFER 


In this range, the atomic electrons may be regarded as non-relativistic in both 
the initial and the final state of the atom. 
The amplitude of the process is 


M$? = eS ko(—q)J b-p(q)D,..(q), (82.2) 


where J,o 1s the transition 4-current of the atom from the initial state (0) to the final 
state (n), and J,, is the transition 4-current of the fast particle; these currents here 
repjace Uu'yu, which would occur, for example, in the scattering amplitude of two 
“elementary” particles such as the electron and muon in (73.17); cf. also (139.3). 
The transition currents are taken in the momentum representation (see (43.11)). The 
cross-section of the process in the laboratory system 1s 


d’p 


Sey 
is ea ee (2 ceo a Se a 
do, = 2778(e — &'— wy) |M §; | 2 pl: On) 


(82.3) 


where wno = E, — Eo is the transition frequency between the states of the atom. The 
finai state may belong to either the discrete or the continuous spectrum, cor- 
responding to excitation and ionization of the atom respectively. In the law of 
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conservation of energy (represented by the delta function in (82.3)), the recoil 
energy of the atom is neglected; this is certainly permissible for a small momentum 
transfer. 

The photon propagator is here conveniently taken in the gauge (76.14), in which 
only its space components are non-zero: 


anes , - 14) 
Di (a) = ~~ (5x ~ 2). (82.4) 


Then only the space components of the transition currents in (82.2) are likewise 
needed. 

The atomic transition current J,o(q) is here the Fourier component of the usual 
non-relativistic expression: 


Snolq) = see | eo Y— YAVID) a, (82.5) 


where wy and wy, are the atomic wave functions; for simplicity, the sign of 
summation over the electrons in the atom will be omitted henceforward, 1.e. the 
formulae will be written as if there were only one electron in the atom. Integrating 
by parts in the first term, we can rewrite this expression as a matrix element: 


Jno(q) = 3(ve 14°" + e'4"y) no, (82.6) 


where v= —(i/m)V is the electron velocity operator. 

Since the momentum lost by the scattered particle is relatively small (\q| <|p)), 
the transition current for this particle can be replaced simply by the diagonal 
element 


J,p(0) = 2pz, (82.7) 


corresponding to classical motion in a straight line (cf. (99.5)); a factor z is included 
to take account of a possible difference between the particle charge ze and the 
electron charge e. 

Since q is small, so is the angle of deviation 3 of the particle. The longitudinal 
and transverse components of q (relative to p) are 


—qy =~ (dp/de)wny = @nolv, — qi ~ |p| 9, (82.8) 


and hence q: p= — E@no0. 
Substitution of (82.4)-(82.8) in (82.2) gives 


2: 

E —Igqes —ige —y} “Tr —y “Tr 

MyP= —- 5 (n 5; (q:ve T"+e '"q:v)+(p-ve “'+e "%"p-y) 
nO 


0) 


In the first term, since 


q: tf +fq-%=2if, 
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where f =e ‘"'" (see QM, §149), the matrix element of this operator is 2i(f)no = 
2 wnrofno. In the second term, e '“’* can be taken as unity, since q is small. Then 


Sarze’ 


MS? aameass {e(e Uno -_ ip rn0Wnot- 


The squared modulus is 


ik 647 Ze 5 ae 
IM PP = ase {e*|(e 4 nol + 2(q + Pno)(P * no) E@no + (p> P20) @noys (82.9) 
here, in the second term, we have put e'"''~1—iq-r, but this cannot be done in 
the first term, for a reason explained in the next footnote but two. 
The energy lost by the fast particle in its inelastic collisions with atoms? is given 
by 


K= 2 [ on do, = : 2 i wno|M $?/? do’, (82.10) 


where the summation is over all possible final states of the atom, and the 
integration is over the directions of the scattered particle; this quantity will be 
called the effective retardation (k/e is known as the energy loss cross-section). 

The integration in (82.10) can be carried out in two stages, by averaging over the 
azimuth of the direction of p’ relative to p and then integrating over do’ ~ 2708 d#, 
where @ is the small scattering angle. The first stage makes the change 


Q°lno7 | Xno ea (Wnol V)Xno, 
where x,o0 is the matrix element of one of the Cartesian coordinates of the atomic 
electrons.t The integration over dd can be replaced by integration with respect to 


q’, since 


= Ont pe? (82.11) 


and therefore 23 d9 = d|q’|/p’, M being the mass of the fast particle. The result is 


. 2 2 
c= Am(ze? S | {eno 28 wo leo (Me +2) EL 2.12) 


v*}J \q 
The lower limit of the integration with respect to q’ is 
lq7|min = (M7/p*) oho. (82.13) 


+ These are often called ionization losses, although they are due to excitation as well as ionization of 
atoms. 

¢ It does not matter which coordinate: after the summation over the directions of the angular 
momentum of the atom in the final state, which is implied below, the matrix element does not depend on 
the direction of the x-axis. 
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As the upper limit, we take a value |q’|, such that 
I <|q’*|,/m <m, (82.14) 


which thus lies in the overlap of ranges I and II (82.1). 

The integration and summation in (82.12) are carried out in the same way as was 
done for the non-relativistic case in QM, §149. The entire range of integration 1s 
divided into two parts: (a) from |q7|min to |q7lo and (b) from |q7\y to |q7|1, where |q7Jo is 
such that 


IMI |p| < V|q?|o< ma; (82.15) 


the quantity ma on the right is of the order of the momenta of the atomic electrons. In 
part (a) we can put e ‘"= 1—iq-r, and the contribution of this part to « is 


lq2lo 
1 1 M? 1 
An(ze’y i {s 4ol\Xaol” poy — — Waal Xnol” ld a‘ 
TT ( Oe 52 Ono o| a p o| Tae |q’| 
q |min 


Aar(ze’y’ “1 p- 
Ss ee > @nolXnol” log VEE _ 0°]. 


In the second term, the integration can be extended to infinity. 
The summation is carried out by means of the formula 


S) wnolXnol” = Z/2m, (82.16) 


where Z is the number of electrons in the atom; see QM, (149.10). The result can 
be written 


9) ze? 27, 212 
neler ys log id - 0°, (82.17) 


where I is an average energy of the atom, defined by 


S WnolXnol” log Wno 


log I =- 
> Wno|Xnol 


= us wnolXnol- log Wno0: (82. 18) 


In part (b), (82.11) shows that |q’| ~ p’9”, i.e. |q’| is independent of the particular 
final state n of the atom, and the limits of integration are also independent of n. 
The summation over n can therefore be taken inside the integral in (82.12). In the 


QE4-W 
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first term, the summation is carried out by means of the formula 
Dd (Ce! nol? Ono = (Z/2m a? (82.19) 


(see QM, (149.5)), and the integral ist 


2mZ(ze’)’ la’|i 
or eeemr ~ LOB eepes 
mv lq lo 
The integral of the second term in (82.12) over this part of the range makes a 
negligible contribution to k. 
Adding the last formula to (82.17), we find as the contribution to x from the 
whole range of small momentum transfers 


9) Z ze" 2 2 2 
oe tog AP 0. (82.20) 


LARGE MOMENTUM TRANSFER 


Let us now consider collisions with a momentum transfer which is large 
compared with the momentum of the atomic electrons (q’> mI). Here we can 
evidently neglect the binding of the electrons in the atom, regarding them as free. 
Accordingly, the collision between the fast particle and the atom may be taken as 
an elastic collision with each of the Z atomic electrons. Because of the high speed 
of the particle, the atomic electrons may be assumed to be originally at rest. 

Let mA denote the energy transferred from the fast particle to an atomic 
electron, and let do, be the cross-section for elastic scattering with this energy 
transfer. The differential effective retardation by the whole atom is then 


dk = ZmAdoy. (82.21) 


The maximum energy which can be transferred to an electron at rest by the impact 
of a particle with mass M > m is 


a 2mp ___2mp* 
max m?+M?*+2me M’*+2me’ 

where « and p are the energy and momentum of the incident particle; see Fields, 

(13.13). We shall also suppose that the energy «, though ultra-relativistic (« > M), is 

nevertheless such that 


& << M2/m. (82.22) 


+ The logarithmic divergence of the integral at the upper limit is the reason why e ‘*’' cannot be 
expanded in powers of q in the first term in (82.12). 
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Then even the maximum energy transfer 
mAmax~ 2mp*/M?=2mv?y?_ (y = e/M = 1/V(1- v?)) (82.23) 


is small in comparison with the initial kinetic energy of the incident particle 
(m Amax < & — M). Correspondingly, the momentum transfer q is always small in 
comparison with the initial momentum p of the particle. This enables us to regard 
the motion of the particle as being unaltered by the collision, i.e. the particle itself 
as having infinite mass. Then the scattering cross-section is found by simply 
transforming the cross-section (80.7) for electron scattering by a fixed centre to the 
laboratory system, in which the electron 1s initially at rest. This is easily done by 
noting that, in the approximation used, 


—~qg~q=4p’sin?38, do’ = wd|q’I/p’, 
and the relative velocity is v in both systems. Formula (80.7) becomes 


do = ne" (1 - |a"| ) alg) 


Am’y’ Iq?" 


The energy transfer A is expressed in terms of the same invariant q*: — q? =2m’A, 
and thereforet 


21 (ze) A \dA 
a= a (1 2p <~) ae (82.24) 


The contribution to the effective retardation from this range of momentum 
transfers is found by integrating (82.21) from the limit |q*|, defined above to 
lq7|max = 2M’Amax. The result is 
alee Z | 2Amax” _ i 
we) log ali vt. (82.25) 

Finally, adding the contributions (82.20) and (82.25), we have for the total 
ionization losses by the fast heavy particle (in ordinary units) 


_ 4nZ(ze’*)y ( 2mv> 2) 
ia mv. log Td = v?/¢2) C2 : (82.26) 
In the non-relativistic case, this reduces to QM, (150.10): 
_ 4nZ(ze’y, _ 2mv* 
a log Tt (82.27) 


+ In this expression, of course, no account is taken of the specific effects of strong interactions when 
the heavy particle is a hadron. Such effects (corresponding to the hadron form factor) are, however, 
important only when |q’| « 1/M’*, and such momentum transfers are excluded by the condition (82.22). 
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and in the ultra-relativistic case 


= 422) | 2mc* | ) 
ie mc? log ia = v?/c?) 1 . (82.28) 


The retardation depends only on the velocity of the fast particle, and not on its 
mass. The decrease of the retardation with increasing velocity (82.27) changes to a 
slow (logarithmic) increase in the ultra-relativistic range. 


PROBLEMS 


PROBLEM 1. Determine the effective retardation of a relativistic electron. 


SOLUTION. The contribution of the range of small momentum transfers is again given by (82.20). 
For large momentum transfers, (82.24) must be replaced by (81.14), which includes exchange effects. 
Integrating A- doy over dA from |q’|:/2m’ to ay — 1) and adding to (82.20), we get 


_ 2nZe* m(y’-1(y-De* (2. 1 1 (y-1 
« = AES [log he (- 2) log 2+ s+ Sy i (1) 


where 
== 0 fey" 


In the non-relativistic case, we get the formula given in QM, §149, Problem, and in the ultra-relativistic 
case (y > 1) 


(2) 


PROBLEM 2. The same as Problem 1, but for a positron. 


SOLUTION. For doy in the range of large momentum transfers, the expression (81.23) must be used, 
the upper limit for A being y — 1. In the ultra-relativistic case, the result is 


= 2228" (ig 2m°c*y? _ 23) 
he ee 12/) 


§83. Breit’s equation 


In classical electrodynamics, a system of interacting particles can be described 
by means of a Lagrangian function depending only on the coordinates and 
velocities of the particles, and correct as far as terms ~1/c’ (Fields, §65). This is 
because radiation appears only as an effect of order 1/c’. 

In the quantum theory, this corresponds to the possibility of describing the 
system by Schrédinger’s equation including second-order terms. For an electron 
moving in an external electromagnetic field such an equation has been derived in 
§33. We shall now derive a similar equation describing a system of interacting 
particles. 

We start from the relativistic expression for the scattering amplitude for two 
particles. In the non-relativistic approximation, this becomes the usual Born 
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amplitude, proportional to the Fourier component of the potential of electrostatic 
interaction of two charges. By calculating the amplitude as far as second-order 
terms, we can establish the form of the corresponding potential, taking account of 
terms ~1/c’°. 

Let us first assume that the two particles are different, with masses m, and m, (say 
an electron and muon). Then the scattering process is represented by a single diagram, 


The corresponding amplitude is 


Mg = e*(itiy"u)D,,.,(q )(iaisy"Us), (83.1) 


qd =Pi-— Pi =P2- D>; 


here it is assumed that the charges have the same sign. If the signs are different, e7 
becomes —e’. 

The subsequent calculations are considerably simplified if the photon pro- 
pagator D,, is chosen not in the ordinary gauge but in the Coulomb gauge (76.12), 
(76.13):7 


22, aut = 7 aq  — 2) 
Do = q?” Dj =9, =D Toe 10 (5 a) (83.2) 


Then the scattering amplitude is 
My = e*'{(iiiy°us)(Gry"u2) Doo + (Hi y'us)(Hy“u2) Dix}. (83.3) 


If all terms in I/c are neglected, the second term in the braces vanishes, and the 
first term gives 


My = — 2m, - 2m (w{?*w)wS)*w)U (q), (83.4) 

where 
U(q) = 47e7/q’, (83.5) 
and w{”, w$”,... denote the spinor (two-component) amplitudes of the non-rela- 


tivistic plane waves, as defined in $23. The function U(q) is the Fourier component 
of the Coulomb interaction potential energy, U(r) = e?/r. 

In the next approximation (with respect to 1/c), the “‘Schrédinger’’? wave 
function of the free particle ¢@s., (normalized by the integral f lPscn|? d’x) satisfies 


+ In this section, factors of c will be written in all formulae, and factors of h in the final formulae. 
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the equation 


H dsch = (€ — mc’) dscns 


a! (83.6) 


HOo=P _ p = -iV, 


2m 8m?c”’ 


which includes the next term in the expansion of the relativistic expression for the 
kinetic energy. The (spinor) amplitude of this plane wave will be denoted by w, 
which tends to w® as 1/c >0. The required scattering amplitude must be expressed 
in terms of these amplitudes, in order to determine from its form the “Schrodinger’”’ 
interaction potential of the particles in the approximation considered. 

In accordance with formula (33.11), the bispinor amplitude u of the free particle 
can be expressed in terms of the “Schrodinger? amplitude w, with sufficient 
accuracy, by 


7 (1- yosekag 
u= Vm) ( ee) (83.7) 
This formula gives 
— 7,0 — 1,1 
Uiy Uy = Uy uy 
aan 2 i 
= 2m, (1 = Pe] wi*w, + amc? wi*(o + pi(o- pi)Wi 


Uiyu; = u;*au, 
= (1/c)w)*{o(o: pi) +(o° pio}w; 
= (1/c)wi*{io X q+ 2p, + q}w4, 


where g = pi— pi = p2— p>. The corresponding expressions for (H5y°u2) and (yu) 
differ in that the suffix 1 is replaced by 2 and q by —q. 

We now substitute these expressions in (83.3). Since the product (@)yui)(Hiyu2) 
already contains the factor 1/c’, the term w’/c’ in the denominator of Dy may be 
neglected. The scattering amplitude is then 


Mj; = —2m, - 2m2(w{*w5* U (py, po, q)W1 Wo), (83.8) 
where 
U (pi, p2, q) = hit eee ee 
Ge oo ~ oi) (83.9) 
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the suffixes 1 and 2 to the Pauli matrices indicate the spinor indices on which they 
act, o, acting on w,; and o> on W>. 

The function U(p;, po, q) is the particle interaction operator in the momentum 


representation. It is then related to the operator U (pi, po, r) in the coordinate 
representation by 


| ei Pi “ry+p3° IU (pr, p>, r) eg iP ry) +p °F) d*x, d*x, 


= (27)d(pi + po — pi — p2)U (pr, po, q). (83.10) 
If the operator U is simply a function U(r)(r=r,—r), then U(p;,p2,q) is 


independent of p; and p., and formula (83.10) reduces to the usual definition of the 
Fourier component: 


[emuw d’x = U(q). 
Hence it is clear that, to find U (pi, p2,r), we must calculate the integral 


| e'1* U(py, po, q) d°q/(27)’, 


and then replace p, and p, by the operators p; = —iV, p2 = —iV2, writing these to the 
right of all the other factors. 
The required integrals are found by differentiation of the formula 


«@ 14m d’q 1 
iqer THE =e 
[e x Qn r (83.11) 


For example, taking the gradient gives 


3 . 
[ ei Sta 4, = iV = 5 (83.12) 
q T 
Next, with a and b constant vectors, we have 


4n(a:q\(b-q) iq €'qQ 1. ce | ave a\i1 dq. 
| q’ oe Say ti(a a) a (b ra) ee 


the resulting integral, after integration by parts, reduces to (83.12), so that 


4m(a-q)(b-q) ig: d’q 1 age Der 
| q' e (mys 28 V) 


r 


=5,[a-b- 2) (83.13) 
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Finally, 


ieee seme iq? hr 2 =-—(a- V)(b- Vs. 


ies, 


In expanding the derivatives, it must be remembered that these expressions include 
the delta function 6(r). To separate this, we note that, after averaging over the 
directions of r, 


1 
~ (a+ V(b V)— = —Ha-b)A += “2 (a- b)a(r). 
Now expanding the derivatives in the usual manner, we find 


oma q)(b - q) eid £4 Soha aac ~32-eO| 3 aq: bd(r); (83.14) 


on averaging over the directions of r, the first term vanishes and only the 
delta-function term remains. 

Using these formulae, we obtain the following final expression for the particle 
interaction operator: 


2 2 
ee oe Ae MO Ne AN gn 
U (pi, po, r) = ; 702 (<a +=) 5(r) 
2 ae 2 
_ e ae rte pus eh 
2mm c?r E P2 + r’ Aer Pt ay 
e-h 2 
eg Me ame er ee meen Ol? 
242 . 8 
cpitaf gh ste mesa} es 


The total Hamiltonian of the two-particle system in this approximation is 
H = A{?+ H+ U, (83.16) 


where H® is the free-particle Hamiltonian (83.6). 


Two ELECTRONS 


If the two particles are identical (two electrons), then the scattering amplitude 
includes a second term which is represented by the ““exchange”’ diagram 
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There is, however, no need to calculate the contribution of this term to the 
interaction operator. The reason is that the description of a system of identical 
particles by means of Schrédinger’s equation can be achieved with an interaction 
operator similar to that for non-identical particles, if the solutions of the equation 
are appropriately symmetrized. In particular, for particle scattering this sym- 
metrization will automatically take account of the contributions to the amplitude 
which correspond to the two Feynman diagrams. 

Thus the Hamiltonian of the two-electron system is obtained from formulae 
(83.15), (83.16) by simply putting m, = m2:T 


A De Geue. i et Aiciae 
H — U (pi, | r), 


ev (eh e* (2 (pipe 
U (pi, por) =" 0 ( ) d(r) — a ee = (Bi i 
eh 
+ tots t-(@1 + 202) +r X pi + (2 + 201) +r x prof + 
1/eh\’ . 3 , : g 
+5 (25) {ou o2— Aer TNO a TF bee 5(r)} (83.17) 


The presence of terms in 6(r) does not, of course, imply that there is a particularly 
strong interaction. The value of all the correction terms after integration is of the 
same order, and according to the sense of the expansion used they are all to be 
regarded as small compared with the first term (the Coulomb interaction). 

The different groups of terms in the interaction operator (83.17) are of different 
types. The first three terms have a purely orbital origin. The next term is linear in the 
spin operators of the particles, and corresponds to the spin-orbit interaction. The last 
term, which is quadratic in the spin operators, describes the spin-spin interaction.# 


ELECTRON AND POSITRON 


The electron—positron system needs special consideration. The scattering am- 
plitude in this case consists of two terms: 


My: = —e*[a(p)y“u(p -)1D,.,(p- — p!)[i(—p+)y"u(—p3)] + 
+ e*[i(—p,)y"u(p_)]D,,(p- + p+) [a(p Dy’u(-p4)]; (83.18) 


the first term corresponds to the scattering diagram and the second to the anni- 
hilation diagram. Since the wave function of the “‘electron + positron” system need 


+ The wave equation with the Hamiltonian (83.17) was first derived by G. Breit (1929); a consistent 
quantum-mechanical derivation was given by L. D. Landau (1932). 

£ This interaction has been mentioned in QM, §72, in connection with the fine structure of the 
atomic levels, and the spin-spin interaction between the electrons and the nucleus is considered in QM, 
§121, in connection with the hyperfine structure of levels. In particular, the formula QM (121.9) 
corresponds to the delta-function term in the spin-spin interaction operator. 
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not be antisymmetric, the two terms make independent contributions to the 
interaction operator. 

The first term (which has the same structure as the amplitude (83.1)) leads, of 
course, to an operator differing only in sign from (83.17). Let us now consider the 
transformation of the second term. 

Here we use the photon propagator in the ordinary gauge: 


An _ An 
Pw = RE Bu = GA ct gt Bow 


In the present case k = p,+p-, and since the particles are ‘almost non-relativis- 
tic’’, we have 


w/c? =(e,+ ele’? ~4m’c’>(p.t+p_yY =k’. (83.19) 
For the photon propagator it is therefore sufficient to write 
Dy. = (alm’c’) gu, 


This already contains a factor 1/c’. It is therefore sufficient to take the amplitudes 
u(p) in the zero-order approximation: 


u(p-) = V(2m) ) u(—ps) = V(2m) (0) 


where w™, w™ are the three-dimensional spinors which appear in (23.12); the index 
(0) will henceforward be omitted. With these amplitudes we have 

ii(—p.)y°u(p-) = u*(—p)u(p-) = 0, 
u(—p.)yu(p-) = u*(—p,)au(p_) = 2m(w*ow-). 


On substitution of these expressions, the “annihilation” term in the scattering 
amplitude becomes 


Mn” = —e? aoe (2m)(w*ow_)(w!*ow’). (83.20) 


It is not yet possible, however, to draw from this any immediate conclusions as to 
the form of the interaction operator. Firstly, the spinors w in terms of which the 
amplitudes u(—p,) are expressed are not yet literally positron spinors. The positron 
amplitudes are got from u(—p,) by charge conjugation, and according to (26.6) the 
corresponding spinors (which we denote by w,) are related to w by w,=o,w*, 
whence 


w* = oyWs = — W405, w =—oayw*. (83.21) 


Secondly, the scattering amplitude must be brought to a form in which the 
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electron spinors (w_ and w/) are contracted, and likewise the positron spinors (w, 
and w,). This is achieved by means of the formula 


(w*aw_)(w!*ow’) =3(wl*¥w_)(w*w’) —3(wl*aw_)(wtow’), (83.22) 


which follows from (28.17) 
Finally, expressing w and w’ in terms of w. and wi by (83.21), we easily find 


(w*w') = (wi*w,), (83.23) 


(w*ow’') = —(wi*ow,). 
Substituting (83.23) in (83.22) and then in (83.20), we obtain the final expression 


for the annihilation part of the scattering amplitude: 


qe’ 
amc? 


M$"? = — 4m? (wows 3+o0,° o.)| wwe), 

the matrices o_ and o, acting on w_ and w, respectively. The expression in the 
square brackets is the interaction operator in the momentum representation. The 
corresponding coordinate operator is 


“, (ann) = th’ e? = 
Ue™(r) = Spare! (3+0.°o0_)d(r), r=r_—-r, (83.24) 


(J. Pirenne, 1947; V. B. Berestetskii and L. D. Landau, 1949). The total electron— 
positron interaction operator is -U + U®™, with U given by (83.17). 


$84. Positronium 


The results obtained in §83 can be applied to positronium, a hydrogen-like 
system consisting of an electron and a positron. 

In the centre-of-mass system, the electron and positron momentum operators in 
positronium are p_- = —p; =p, where p = —ifV is the operator of the momentum of 
relative motion corresponding to the relative position vector r=r_—r,. The total 
Hamiltonian for positronium ist 


A 


p e? - 7 ” 
aes amar: Vit V24+ V3, 


a4 2 
5, 25(r)-—2——{p? + 0 PB Rt 
Vi= Amici t 47H08(r) atom 8 5 ’ is 
4.1 


V3; = 6p) = ae = ‘S| + 4irpaGS? — 2) 6(r). 


+ In ordinary units. 
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Here py = efi/2mc is the Bohr magneton, hl =r X p is the orbital angular momentum 
operator, S=(o,+0 _) the total spin operator of the system, whose square 
S? = (3+0,-o_). V; includes all the purely orbital correction terms, V2 the spin-— 
orbit interaction, and V; the spin-spin and “annihilation” interactions. 


The “unperturbed” Hamiltonian 
H =p'/m-e?/r 


naturally differs from the Hamiltonian of the hydrogen atom only in that the 
electron mass is replaced by the reduced mass 3m. The energy levels of positronium 
therefore have absolute values which are half those of hydrogen: 


E =—-me'/4h’n’, (84.2) 


where n is the principal quantum number. 

The remaining terms in (84.1) cause a splitting of the levels (84.2), 1.e. the 
appearance of a fine structure. The resulting levels are classified primarily by the 
values of the total angular momentum j. We also see that the particle spin operators 
appear in the Hamiltonian (84.1) only through the sum S. This means that the 
Hamiltonian commutes with the squared total spin operator S$’, i.e. the value of the 
total spin continues to be conserved in the approximation considered (the second 
approximation with respect to I/c). The energy levels of positronium can therefore 
be classified by the total spin, which takes values S = 0 and S = 1. The levels with 
spin 0 are called parapositronium levels, and those with spin | orthopositronium 
levels. 

It must be emphasized that the conservation of the total spin in positronium is 
actually exact, and does not depend on any particular approximation with respect 
to 1/c; it follows from the CP invariance of electromagnetic interactions. Posi- 
tronium is a strictly neutral system, and its states therefore have definite charge 
parity and combined parity. The latter is equal to (— 1)°*! (see §27, Problem); since S 
can take only two values, 0 and 1, the conservation of combined parity is equivalent to 
that of total spin. 

When S=0 the total angular momentum j is equal to the orbital angular 
momentum, but when S = I and j is given, the number | can take the values j, j + 1, 
so that in general each level (n, Jj) of orthopositronium is split into three. Since the 
values |! =Jj and !=j +1 correspond to opposite parities, the Hamiltonian has no 
matrix elements between these states. But the perturbation operator (the first term 
in V3) in general has non-diagonal elements between states with |!=j+1 and 
| =j—1; the number | then, of course, no longer has the strict significance of an 
orbital angular momentum. 

The Zeeman effect in positronium has some unusual features (V. B. Berestetskii 
and I. Ya. Pomerachuk, 1949). 

The orbital magnetic moment of positronium is always zero: since in posi- 
tronium r, X p; =r_ Xp_, we have the operator 


fu = wolrs X Ps —r. X p-) = 0. 
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The spin magnetic moment operator is 
fis = polo. — o_); (84.3) 


it is not proportional to the total spin operator S = 3(o0, + o_), and the operators S” 
and yu’ do not commute. The states with definite values of the total spin S and its 
component S, are therefore not, in general, eigenstates for the magnetic moment. 

States with given S kand S, are described by spin functions yss having the form 


Xu=Asa-,  X1-1 = B+ B-, 
1 
X10 = 75 (aiB-+ a_B.), (84.4) 


X00 = nz (a,B-—a-B+), 


where a and £ are the spin functions of one particle corresponding to spin 
projections +; and —}4; the suffixes + and — indicate that the function belongs to 
the positron and the electron respectively. The first two spin functions, x1 and 
X¥1-1, are also eigenfunctions of the operator w,, corresponding to the eigenvalue 
zero. The functions yio and yo are not eigenfunctions of p,, but the following 
combinations are eigenfunctions: 


5 (x10 + X00) = a+B-, 5 (x0 = x00) = a-B+. (84.5) 


It is easy to see that the only non-zero matrix elements (S’S/|u,|SS,) calculated 
from the functions (84.4) are 


(00| 1,10) = (10}2,|00) = 20. (84.6) 


In weak magnetic fields (when uoH «KA, where A is the difference between the 
level energies with S = 0 and S = 1) the initial approximation for the calculation of 
the Zeeman splitting is formed by states with definite values of the total spin. In the 
first approximation, this splitting is given by the mean value of the perturbation 
energy operator 


Vy = —(.H. 


But all the diagonal matrix elements of the operator ji,, and therefore Vu, as 
calculated from the functions (84.4), are zero. Thus, in weak fields, there is no 
linear Zeeman effect in positronium. 

In the opposite limiting case of strong fields (4H >A), we can neglect the spin 
interaction which brings about definite values of S. The components of the split 
level will then correspond to states with definite values of uw, = +2,» (described by 
the functions (84.5)), and the displacement of these components will be +2y0H. 
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PROBLEMS 


PROBLEM 1. Determine the fine structure of the levels of parapositronium (V. B. Berestetskii, 
1949). 


SOLUTION. The required level splitting energy is given by the mean values of the correction terms 
in the Hamiltonian (84.1), calculated by means of the wave functions of the unperturbed states with 
different values of j =1 (=0,1,...,n— 1). When S =0, the only non-zero contributions come from V\ 
and the second term in V3. 

The unperturbed wave functions, which we denote by ¥y, satisfy Schrédinger’s equationt 


py =-Ay=(E+-) uy, E =~1/4n?. 
Hence 
pv=6'(E+2) y= (E+2) v-vat+2(v4)- cy 
= (E+ Vy tansy +5 


The mean value is 


= (e+1) + amino + {[ dr do. 


The integral is equal to —f |(0)|? do; since (0) = 0 except when | =0, and the wave functions of S 
states are spherically symmetric, the integral i is —4a| p(O)f’ and cancels with the second term. 
Using the orbital angular momentum operator | = r X p, we can write 


The other required mean value is therefore 
lo 
[ vt 5--ppya'x = - | yrT4 oY ps 


== (E+2)-4mlyOP-GD*: 


if 1 =0, the last. term does not appear. 
According to the familiar formulae 1 in the theory of the hydrogen atom (QM (36.14), (36.16)), with the 
electron mass m replaced by 5m, we have 


a a ee | 
\¥(O)/’ = Bank Om r = 902 r = awl 4 1)’ 


—a | 
A EDO) ee 


From these formulae, we find the required energy levels of parapositronium: 


Bu =~ qh a? MEL (Ltt) 
ne An? W wW\Ql+1 = 32n/' 


+ The fine structure of orthopositronium has been discussed by A. A. Sokolov and V. N. Tsytovich, 
Zhurnal éksperimental’ noi i teoreticheskoi fiziki 24, 253, 1953. 
t In the calculation it is convenient to use atomic units. 


§85 The Interaction of Atoms at Large Distances 347 


PROBLEM 2. Determine the difference between the energies of the ground states (n = 1,1 =0) of 
orthopositronium and parapositronium. 


SOLUTION. The dependence of the energy on the total spin S when | = 0 arises only from the mean 
value of the second term in V3; the first term gives zero on averaging over angles in the spherically 
symmetric S state.t The ground level of orthopositronium (S;) lies above that of parapositronium (So) 
by an amount 


ECS) — E( 'S) = 4 0? BE = 8.2x 10 *eV. 


§85. The interaction of atoms at large distances 


Attractive forces act between two neutral atoms at a distance r apart which is 
large compared with the dimensions of the atoms themselves. The usual quantum- 
mechanical calculation of these forces (see QM, §89) is, however, inapplicable at 
very large distances, because this calculation considered only the electrostatic 
interaction, i.e. retardation effects are ignored. Such a treatment is valid only if the 
distance r is small in comparison with the characteristic wavelengths A» of the 
interacting atoms. In this section we shall give a calculation not subject to that 
limitation. 

The procedure is much the same as in §83: the amplitude of elastic scattering 
(i.e. scattering without change of internal state) for two different atoms is cal- 
culated in the first non-vanishing approximation. The resulting expression is com- 
pared with the amplitude which would result if the interaction between the atoms 
were described by the potential energy U(r). 

In the latter case, the first non-vanishing S-matrix element describing the 
process in question would be the first-approximation element 


Si =—l | Wi* (ry) Wr" (12) U (r) Wir) Yr) d°x, d°x2 X 


x | expl-iter+ ery rs (85.1) 


Here wW&, Ww and wi, Ws are the time-independent parts of the wave functions (plane 
waves), describing the translational motion of the two atoms with initial and final 
momenta; €;, €2 and e}, e+ are the kinetic energies of this motion; the coordinates r, 
and r, of the atoms as a whole can be regarded as the coordinates of their 
nuclei, and the distance r = |r; — r.|. The time integral in (85.1) gives, as usual, the delta 
function which expresses the law of conservation of energy. For convenience in the 
subsequent comparison, however, it is better to consider formally the limiting case of 
atoms of infinite mass; for given momenta, this limit corresponds to zero energies e. Or 
we can say that the times considered are small in comparison with the periods 1/e. 


+ The averaging over angles must precede the integration over r, as is evident from the manner of 
calculation of the integral (83.14) which leads to the first term in V3. 
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Then (85.1) becomes 


Sii as | | {J Wi de* U (nd d°x, d*x», (85.2) 


where t is the time integration range. 

The actual calculation of the elastic-scattering amplitude, under these assump- 
tions, can be divided into two stages. We first average the S-operator over the 
wave functions of the unchanged (ground) states of the two atoms (for given 
coordinates r; and r, of their nuclei) and over the photon vacuum: no photons are 
present at the beginning and end of the process. We then obtain a quantity which 1s 
a function of the distance between the nuclei, and which we denote by (S(r)).7 In 
order to find the required transition matrix element, we have then to calculate the 
integral 


8. =[f view Simbu Paid x (85.3) 


Comparison with (85.2) shows that, if (S(r)) is obtained in the form (S(r))= 
—itU(r), then the function U(r) is the required energy of interaction of the atoms. 

Since we are here concerned with a collision not of elementary particles but of 
more complicated systems, namely atoms, which may be excited in the inter- 
mediate states, the usual formal rules of the diagram technique are not directly 
applicable, and we shall begin from the expression of the S-operator as the 
expansion (72.10). 

In the interaction of atoms, the important field components are those whose 
frequencies are of the order of atomic frequencies or less. The corresponding 
wavelengths are large compared with atomic dimensions. The electromagnetic 
interaction operator can therefore be taken in the form 


V =—K(r,) : d, — E(r2) « db, (85.4) 


where dj, d2 are the dipole moment operators of the atoms (i.e. the time-dependent 
or Heisenberg operators) and E(r) is the electric field operator at the positions of 
the corresponding atoms. 

The mean values of the dipole moment of the atom in its stationary states are 
zero (QM, 875). Hence it follows that a non-zero amplitude occurs only in the 
fourth approximation of perturbation theory, i.e. as the matrix element of the 
operator 


go CF | dt,... i dt,. T{V(t1) V(t2) V(ts) V(ta)}: (85.5) 


in lower orders, every term in the product of operators V will contain at least one 
of the operators d, and d, in the first degree, and on averaging over the state of the 
corresponding atom the result is zero. 


+ In place of the more lengthy notation for a diagonal matrix element, indicating the states of the 
atom and of the photon field. 
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Let us now average the operator (85.5) over the photon vacuum. According to 
Wick’s theorem, the expectation value of the product of four field operators E is 
the sum of products of pairwise expectation values (contractions). The division into 
pairs can be made in three ways, which may be represented by the diagrams 


1 2 1 2 1 2 
ona Q p 9 9 
\ i | | 

> ye I | 

yy | | (85.6) 

x a ! | 

Oa See oft ‘S d b 
3 4 3 4 3 4 


where the broken lines represent contractions and the numbers correspond to the 
arguments ¢), ft, t3, tz. Moreover, spatial coordinates r; or r. may correspond to 
each point, with two points having r; and two ro, since otherwise, in the relevant 
term of the sum, one of the operators d; and d> will appear in the first degree, giving 
zero On averaging with respect to the state of the atom. It is clear that there must 
be one r; and one r, at the ends of each line, since otherwise the diagram (i.e. the 
corresponding term in the matrix element) will reduce to a product of independent 
functions of r; and of r2 instead of being a function of the difference r;—r.; such 
terms do not pertain to scattering.t In accordance with these conditions, the 
arguments r, and r, can be assigned to the four points in the diagram in four ways. 
Using also the commutativity of the operators d; and d) and averaging over the 
states of each atom, we find that all the 3 x 4= 12 terms thus obtained are equal, 
differing only in the naming of the variables of integration. The result is 


(SCA) =4f dtr... f dts. (TEC, HE (rs, t2) x 
x (T(E\(ro, t3)Em (1, ta) XT (di (th) dim (ta)))(T (dex (tr) dai(ts))), (85.7) 


where i, k, |, m are three-dimensional vector indices. 
To calculate the quantities 


Dik(X1 — Xo) = (T(Ei(*1) Ex(%2))) (85.8) 


we use the gauge in which the scalar potential ® = 0. Then Ek = — dA/dt, and we 
have 


Dix — Xo) = ss (TA (41) Ak (X2))) 


an Oty 


o? 
l or Dix (X), 


where x = X;— xX, and D(x) is the photon propagator in this gauge.# 


+ They give corrections, of no interest here, to the energy eigenvalues of each atom. 

+ The first derivative dDx(t)/dt has a finite discontinuity at t=0. The second derivative, i.e. the 
function D®,(t), therefore includes a delta-function term ~5“ (x2— x). This term, however, is zero for 
all r; #r2 and is of no interest here. 


QE4 - X 
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We shall find it more convenient to use the propagator D,(w,r) in the mixed 
w—r representation, related to D,(t, r) by 


Dx (t, r) = | Dx(w, r) ee dw/27, 
with 
Dk(t, r)=—i | w’ Dia, r) ge dw/27. (85.9) 


The quantities 
Oi (ty — to) = UCT (dit) di (t2))) (85.10) 


can be expressed as a Fourier integral 


fo.) 


a(t) = | e a.(w) dw/27. 


—0b 


Putting for convenience t,= 0, t; = t, and using the definition of the T product, we 
can write 


ie.) 


Qi (w) = | e'" w(t) dt 


—& 


0 00 
=] | ew (d,(0)d;(t)) dt+i | e'*(d;(t)d,(0)) dt. (85.11) 
0 0 


The mean values (with respect to the ground state of the atom) which appear here 
can be expressed in terms of the matrix elements of the dipole moment: 


(d,(0)di(t)) = > (dion (dino e'", 
(di(t)d,(0)) = >) (dion (dk)no e°"™. 


For convergence of the integrals in (85.11) it is necessary to take w in the first 
integral as wm — i0, and in the second as w +i0. Carrying out the integrations, we 
obtain 


on(a)=> (Stead 1 (dion (di)no ) (85.12) 
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If the ground state is an S state, this tensor is simply a scalar, a,(@w) = adx, 
where 


a(w)=5 > \don|? (—~— +) (85.13) 


o0-w—-i0 arytw—i0 


If, however, the atom has an angular momentum, the same result is obtained on 
averaging over the directions of this angular momentum, and it will be assumed that 
this has been done; we are, of course, interested in the interaction of atoms 
averaged over their mutual orientations. 

Comparison of (85.12) with (59.17) shows that ay.(w) is the same as the tensor 
for coherent scattering of a photon of frequency w by an atom. According to 
(59.23), a(w) for w >0 is the polarizability of the atom. Its values for w <0 are 
expressed in terms of those for w >0 by means of the relation a(—w) = a(w), 
which is obvious from (85.13). 

Substitution of these expressions in (85.7) gives 


dQ), dM) dw, dw» 


2a 2a 2n QT 


(S()) =3 | dtr... dt, 


x a(O;)ar(O2)0{Dy(o1, r)w3Dx (a, r) x 
x exp{—iw,(t; — tr) — im2(t3 — t4) — 104(t; — ta) — 1On(t2 — ts)}, 


where r=r,—f, and we have used the fact that Dj,(w,r) is an even function of r. 
The integration over three times gives delta functions (whereby —Q,; = 22 = a2 = 
w,), and that over the fourth time gives a factor ft: 


(S(r)) = —itU(r), 


where 


[e.¢) 


U(r) = ti | w'a(@)a(w)[Di(w, r) dw/27. (85.14) 


—O 


This formula gives the energy of interaction of two atoms at any distance large 
compared with the atomic dimensions a. We have now to find and insert an explicit 
expression for Dx(@, r). 

Comparison of the expressions (76.14) and (76.8) shows that 


Di(w, k) = — (3: = “t)D(o, k), 
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where D(w,k) is given by (76.8). In the w —r representation, the relationship is 
correspondingly 


I 


Do, 6) = — (5: Bae x ae 


=a) Dlw.r). (85.15) 


Substitution of D(w, r) from (76.16), and carrying out the differentiations, gives 


1 1 
Di (o, r) = Ee (1 ‘or at) + 


r of 
xike (3 3b |= 
se (3 7 ae | bree (85.16) 


Then, substituting this expression in (85.14), we find by a simple calculation, 
using the fact that a(w) is even, the final expression for the interaction energy of 
the atoms: 


Ui)= sa | wtaloraxtwye™|1+ 2-5-4 dw. (85.17) 
0 


This general result can be simplified in the limiting cases of “‘small’’ distances 
(a <r <do) and “‘large”’ distances (r > Apo). 

When r <A o, the important values in the integral are (see below) w ~ wo, where 
wo ~ C/Ag are the atomic frequencies, and therefore wr <1. Then only the last term 
in the bracket need be retained, and the exponential may be replaced by unity. 
Writing the integral as one from —~ to © (with a view to the subsequent 
calculations), we find 


oO 


U(r) = — | a)(w)ar(w) dw. (85.18) 


— 


The interaction law at these distances proves to be 1/r®, as it should. The integral in 
(85.18) is easily calculated, after substitution of a(w) from (85.13), by closing the 
contour of integration with an infinite semicircle in the lower half of the complex 
w-plane; the integral is determined from the residues of the integrand at the poles 
W = Wno~ Wo. Assuming (to simplify the result) that the two atoms are identical, we 
find (in ordinary units) 


2 2 
U(r) = = DA don! Idowl (85.19) 


h(wno + @n’ 0) 


the same as the familiar London’s formula (see QM, §89, Problem). 
In the limit of large distances (r > Ao), the important values in the integral are 
w <=c/r<wo; when w 2 qo, the integral is made small by the rapidly oscillating 
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factor exp 2iwr. We can therefore replace the polarizabilities a,(w) and aw) by 
their static values a;(0) and a,(0). The integration is then elementary. (To ensure 
convergence, r in the exponential is to be replaced by r+ i0.) The final result is (in 
ordinary units) 


_ 23 ficas(0)a20) 


Ute) = At r’ 


(85.20) 


(H. B. G. Casimir and D. Polder, 1948). 


+ The derivation given here is due to I. E. Dzyaloshinskii. 


CHAPTER X 


INTERACTION OF ELECTRONS WITH PHOTONS 


$86. Scattering of a photon by an electron 


THE conservation of 4-momentum in the scattering of a photon by a free electron 
(the Compton effect ) is expressed by the equation 


ptk=p'+k', (86.1) 


where p and k are the 4-momenta of the electron and the photon before the 
collision, and p’ and k’ their 4-momenta after the collision. The kinematic in- 
variants defined in §66 are 
s=(pt+ky=(p' +k’? = m?+2pk = m* + 2p'k’, 
t =(p—p') =(k'—k)y = 2m? — pp’) = — 2kk’, 
u=(p —k'? = (p'— ky = m?— 2pk' = m*— 2p'k, 


(86.2) 


sttt+tu=2m’. 


The process in question is represented by the two Feynman diagrams (74.14), 
and its amplitude is 


My = — 47e*e,*e,(i'Q”u), (86.3) 
where 
1 1 
pv pe v v =a f pb 
Q ery | (yp + yk + m)y 1 eee Y (yp — yk'+ m)y". (86.4) 


Here e, e’ are the polarization 4-vectors of the initial and final photons; u, u’ the 
bispinor amplitudes of the initial and final electrons. 

According to the rules given in $65, for arbitrary polarization states of the 
particles |M,;;|’ is replaced by 


[Myil’ > 16777e* tr{p ©” pQ”Q*’ppPQ*}, (86.5) 


where p®, p” are the density matrices of the initial and final electrons, p™, p™ 
those of the photons. The photon (tensor) indices are written explicitly, but the 
electron (bispinor) indices are not. The trace symbol refers to the latter indices, as 
does the superscript plus in the definiton Q,, = y°Q{,y°. 
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Let us consider the scattering of an unpolarized photon by an unpolarized 
electron, without regard to their polarizations after the scattering. The averaging 
with respect to the polarizations of all particles is given by the density matrices: 


(y) 


pv? (y)) — 


= py p= 


— 2 aus (yp +m), p°" = (yp'+m); 


the change to summation over the polarizations of the final particles involves a 
further multiplication by 2 x 2 = 4. 

From formula (64.23), in which we must now put I* = i(s — m’)’ (see (64.15a)), 
we find the cross-section 


do = =e “A trl(y0'+ m)Q**(yp + m) Qu}. 


From (65.2a), Q,, = Qy,. Separating the terms which differ only by the changes 
k <> —k’ (and accordingly s <u), we can put the cross-section in the form 


4 
do = dt a. [f(s, uw) + 9(s,u)+ flu, s) + e(u, s)], 
with the notation 


f(s, u)= iow tr{(yp'+ m)y"(yp + yk + m)y"(yp + m)y,(yp + yk + m)y,}, 


2(s,u) = erm OP’ +m)y"(yp + yk +m)y"(yp + m)y, X 


x (yp — yk'+ m)y,}; 


this notation takes account of the fact that the result will depend only on the 
invariant quantities. 

The summation over p and v is effected by means of formulae (22.6); then, 
omitting terms which contain an odd number of factors y, we obtain 


f(s, u)= omy tr{(yp')(yp + yk)(yp)(yp + yk) + 4m*(yp + yk)(yk — yp')+ 
+ m*(yp)(yp') + 4m 4}. 


The trace is calculated by means of formulae (22.13); expressing all quantities in 
terms of the invariants s and u, we easily obtain 


f(s, u) = Ge 2 Gen. {4m‘*—(s — m’)(u— m’) + 2m*(s — m’)}. 
Similarly, 


2m? 


BO = mum?) 


{4m?+(s —m’)+(u-—m’)}. 
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The cross-section is thus 


m? m? 1/s—-m* u-m 
- (; mu maa a+ :)} ages 


where r, = e?/m. This formula expresses the cross-section in terms of invariant 
quantities, and can easily be used to express it in terms of the collision parameters 
in any specified frame of reference. 
Let us do this for the laboratory system, in which the electron is at rest before 
the collision: p = (m, 0). Here 
s—m’=2ma, u—m’*>=—2mo’. (86.7) 


Squaring the equation of conservation of 4-momentum in the form p+k-—k'=p’, 
we have 


pk — pk'— kk’ =0, 
whence (in the laboratory system) 
m(@ — w')— ww'(1—cos 3) = 0, 


where @ is the angle of scattering of the photon. This equation gives the relation 
between the photon energy change and the scattering angle: 


oe ee 
oi =p (1— cos 8). (86.8) 


The invariant t is 
t = —2kk' = —2wo'(1—cos 9). 
For a given energy w we find, using (86.8), 
dt =2w" dcos 8 =(1/7)w"” do’ (do' = 27 sin 6 d9). 


Substitution of these expressions in (86.6) gives the following formula for the 
scattering cross-section in the laboratory system: 


2 ’ 
ie ir:(2) (S+2- ti 8) do’ (86.9) 


W @ 


(O. Klein and Y. Nishina, 1929; I. E. Tamm, 1930). 
Since the angle 3 is unambiguously related to w’ by (86.8), the cross-section can 
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be expressed in terms of the energy w’ of the scattered photon: 


! , 2 
do = nr? noe \S+o4 (= -") = 2m(---)|, (86.10) 


with w’ varying in the range 
@ ! 6 
1 ] = Sw. (8 . l 1) 


When w <m, we can put w' = w in (86.9), and the result is, as it should be, the 
classical non-relativistic Thomson’s formula 


do =37r2(1+ cos? &) do’; (86.12) 


see Fields, (78.7). 
To calculate the total cross-section, we return to formula (86.6). The invariants 
s, t, u there take values satisfying the inequalities 


s=m’, t <0, us <m*, (86.13) 


These have already been derived in $67; the corresponding physical region is I in 
Fig. 7 (367). They are also easily obtained directly from the expressions for the 
invariants in the centre-of-mass system. Here p+ k= 0, and the energies e of the 
electron and w of the photon are related by ¢ = V(w? +m’). The invariants are 
=(e+tw)l=m’+2o0(w+«), 
u=m’*—2w(e+w cos 8), (86.14) 
t = —2w7(1—cos 6), 
where @ is the scattering angle (the angle between p and p’ or between k and k’). 
The three inequalities (86.13) then result from the conditions #20 and —15 
cos @é <1. 


For a given s (1.e. a given energy of the particles), the integration with respect 
to t can be replaced by one with respect to u = 2m*—s —t over the range 


mis <u<2m’-s. 
Using instead of s and u the quantities 


x =(s—m?’)/m’, y =(m’— u)/m’, (86.15) 


we obtain 


x 


_ 8ar2 Le A Ds Dey 
sate f (Cae 


x/(x+1) 
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and after the elementary integration 


1 4 1 8 | 
o = ar? {(I-2-3 wy los +) +5+2-s7 5a} (86. 16) 


The leading terms in the expansion for x <1 (the non-relativistic case) are 


o = Sar: (1—x). (86.17) 


The first term is the classical Thomson cross-section. In the opposite, ultra- 
relativistic, case (x > 1), the expansion of (86.16) gives 


o =2nr . (log x +). (86.18) 


In the laboratory system, 


x =2ea/m, (86.19) 


so that formulae (86. 16)-(86.18) give immediately the photon energy dependence of 
the cross-section for scattering by an electron at rest. Figure 13 shows o as a 
function of w/m. 

In the ultra-relativistic case, the cross-section decreases with increasing energy 
both in the laboratory system (o « w 'log w) and in the centre-of-mass system 
(x ~4w’/m’, o < w *logw). But the angular distribution in the ultra-relativistic 
case has quite different forms in these two frames of reference. 

In the laboratory system, the differential cross-section has a sharp peak in the 
forward direction. In a narrow cone 3 < V(m/o) we have w’~w and the cross- 


8= 2 


QW 


001 002 0050102 05 1 2 5 10 20 50 100 


w/m 


Fic. 13. 
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section da/do’~ rz, reaching the value r2 as 8-0. Outside this cone, the cross- 
section decreases, and in the range 0*> m/w (where w’~ m/(1—cos 3)) we have 


da 


do' J 


*"©w(1—cos 8) 
i.e. the cross-section is reduced by a factor ~w/m. 
In the centre-of-mass system, on the other hand, the differential cross-section 
has a peak in the backward direction. For 7 — @ <1 we have from (86.14) 


s—m* 4’ m*—u aw? ‘ 
Fe we BO ~14+25(9-6y. 
29 m2 ek ) 


The largest term in the cross-section (86.6) is 


2 
is 5 m* dt 
do =~ 8nr~ AG OG ay 


whence 


do’ 


a ND 
aor ake 1+ (a — 0) w*/m* 


(86.20) 


The cross-section da/do' ~ r2 in a narrow cone 7 — 6 < m/w; outside this cone it is 
reduced by a factor of the order of ~w/m’. 


§87. Scattering of a photon by an electron. Polarization effects 


We shall now go back to the original formulae of 886 and show how the 
calculations must be made in order to take account of the polarization of the initial 
and final photons and electrons. 

The density matrix of the photon can be expressed, according to (8.17), by 
means of a pair of unit 4-vectors e“”, e® which satisfy the conditions (8.16). In the 
present case, these vectors can be taken to be, for both photons, the 4-vectors 
defined in 8707 


eV—=NIV(-N’), =e = P/V (—P?), (87.1) 
where 
P* =(p* + p")— K*(pK + p'K)IK’, 
N* = e“”P.¢.K,, 
K* =k* +k”, 
q’* =— Geen as = p*—p". 


(87.2) 


+ An alternative procedure is to consider from the start a specified frame of reference (say the 
laboratory system) and take for each photon as e“”, e” purely spatial unit vectors e = (0,e) which are 
orthogonal to the photon momenta and to each other. In that case, however, the calculations will be 
entirely in three-dimensional form, and the result will not be invariant. 
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The quantities Q”” in (86.5) are given by (86.4). They may be regarded as 
components of a 4-tensor (in the sense that they form a 4-tensor after being 
contracted with spinors as the quantities #’Q”’u). All the components of a 4-tensor 
can be obtained by projecting it on four mutually orthogonal 4-vectors, for instance 
on P, N, q and K defined above. Since the tensors p\?), p°? contain only com- 
ponents along P and N, we need in fact only the components of Q,, along these 
4-vectors. In other words, it is sufficient to find in Q,, the terms of the form 


Qui = Qllewe, Her ey) + Qieve + eve) = 
IONE? ene) Ore, = ere,): (87.3) 
the remaining terms would disappear on substitution in (86.5). The quantities Qo 
and Q,; are scalars in the same sense that Q,,, is a 4-tensor; they therefore contain 


the matrices y only in the “invariant” combinations yK, etc. In the same sense, Q, 
and Q, are pseudoscalars (N is a pseudovector), and hence must contain the matrix 


5 
y e 
By direct projection of the tensor Q,, we find 


=i (1 2) (2 
0= 10 (ee + eer), 


etc. In the calculation it is convenient first to express Q,, in terms of the mutually 
orthogonal 4-vectors P, N, q, K: 


Qe = yey +m, yayP +m 


1 
| ae (u v_ v Bb 
Hee Foy gh EY yy). 


There then remain some purely algebraic calculations using the formulae given 
in §22. It is also possible to make changes in Q”’ which do not affect the result 
after the subsequent construction of the product #’Q”’u. For instance, since 


i'(yp + yp')u = 2mi'u, 
ii'y°(yq)u = a'(y’(yp) + (yp')y°)u = 2mit'y°u, 


we can make in Q”” the changes 


yp +yp'>2m, —-y°(yq)>2my’. (87.4) 
The detailed calculations are omitted here; the final result ist 


Qo = —mai., Q, a sia, (yK), 


(87.5) 
Q,.=-ma+y,  Q3;=ma,+a_(yK), 


t The expression (87.3) with the values (87.5) corresponds to the formulae (70.11)-(70.13) derived in 
870 from general considerations. Besides the equations f3;=fs=0 which follow from T invariance, 
another invariant amplitude (f2) is here zero also. This is a property of the approximation of perturbation 
theory used here, and would not occur in higher approximations. 
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where 


In the subsequent calculations, it is convenient to apply to Q,, the same formal 
treatment as has been described in §8 for the photon density matrix: the four 
components of the tensor (87.3) in the directions e‘”, e® are combined to form a 
two-rowed matrix Q which is then expanded in terms of Pauli matrices. Similarly 
to (8.18), we obtain 


Q=Q+Q°-a, Q = (Qi, Qo, Qs). (87.6) 


The components of the tensor Q. = y°Qi_y° in (86.5) are easily seen from (87.3), 
(87.5) and the rules (65.2a) to be obtained from those of Q,, on replacing Qo, Qi,... 
by Qo, Qi,..., where 


Qo = Qo, Q, = 7, Q, ==), Qs = Q;, (87.7) 


and simultaneously interchanging the indices p, v.t In matrix form, 
O=Q+Q°-4. (87.8) 


Let us now define more precisely the sense of the 4-vectors e“”, e® in relation 
to the polarization of the photons. For each photon, the independent directions of 
polarization will be determined by the components of the 3-vectors e‘”, e” trans- 
verse to the photon momentum k.+# It is easily seen that, in both the centre-of-mass 
system and the laboratory system (in which the initial electron 1s at rest), the vector 
P is in the plane of k and k’, and N perpendicular to that plane. The direction e"” is 
therefore that of the polarization perpendicular to the plane of scattering, and e” is 
that of the polarization in the plane of scattering. It must also be noted that the 
Stokes parameters &, &, € are defined with respect to the axes xyz, which form a 
right-handed set with the z-axis in the direction of k. It is easily seen that for the 
initial photon the vectors N, P,, k form such a set, and for the final photon the 
vectors N, —P/, k’ (where P, and P; are the components of P perpendicular to k and 
k’ respectively). A change of sign e” in the photon density matrix (8.17) is 
equivalent to a change of sign of €, and &. The density matrices of the initial and 
final photons, referred to the unit 4-vectors e“” and e, are therefore 


p? =7(1 ae a), E= (&1, £), &3); 


je, (87.9) 
pM=11+€&-a), & =(-E1, —&, &3). 


+ For the matrix Q,, in the original form (86.4) we should have simply Q,. = Q,,. This property. 
however, is lost as a result of transformations such as (87.4). 

+ The longitudinal components of e, like the time components of the 4-vectors e, can here be simply 
ignored; this is permissible, owing to gauge invariance. 
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y OM y Q 
Pp ve P Y 


is now calculated as the trace of the matrix product of the matrices (87.6)—(87.9), 
using (33.5). The result is 


|Myil’ = 8277 e4 tr{(p Qop Qo + p"Q + pQ) + 
+ (E+ €) - (9p QopQ + p©’Qp'Qo) — iE — E) - p'Q x p©Qt 
+ (E+ Ep QopQo — p"Q + p°Q)+ 
+ pO + Q)pOCE + Q) + pO"E DoE - Q)- 
— iE X E+ (p’Qop'Q — p’Qp Qo}. (87.10) 


SCATTERING BY UNPOLARIZED ELECTRONS 


We shall complete the calculation of the cross-section for the scattering of 
polarized photons by an unpolarized electron, summed over polarizations of the 
final electron. To do so, we must put in (87.10) 


p=x(yp +m), — p©" =2(yp' + m), 


double the result, and substitute it in place of |M,|’ in the formula (64.22) for the 
cross-section: 


1 dtdd_ 


do = 37 (s— my? 


Mil’, 


where ¢ is the azimuth in the centre-of-mass or laboratory system. Some of the 
terms in (87.10) are identically zero; the calculation of the other terms gives the 
final result (with the notation (86.15)) 


ower ttlornf- G4) -E-s)) 


selfs HEeNoE 3) 
iF ee[(t-2) + (5-5) +5}. (87.11) 


Here do is the scattering cross-section for unpolarized photons given by (86.9); the 
factor 3 appears because there is no summation over the polarizations of the final 
photon in (87.11). 
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In the laboratory system, formula (87.11) becomes 


2 
do = ire(2] do'{Fo + F3(&3 + &3) + Furé:€) + Frd265 + Fs3€3€3}, 
do' = sin 3 dd dd, (87.12) 


where 


yaa he Sin 0, F, = sin’ 0, 
: (87.13) 
F;, =2 cos 9, Fy = (2+ 2) cos 9, F3=1+cos? 3 


(U. Fano, 1949). Although (87.12) shows no explicit dependence on the azimuth ¢ 
of the scattering plane, there is an implicit dependence, since the parameters &,, &, 
€; are defined with respect to the axes xyz, which are fixed to the scattering plane. 
The x-axis is the same for both photons and perpendicular to the scattering plane: 


x \|k xk’, 
and the y-axes are in that plane: 
y|[kx(kxk’), —-y'[[k’x k xk’). 


Taking the sum of cross-sections differing in the sign of &' (i.e. putting &' = 0 and 
doubling the result), we obtain the total cross-section (summed over polarizations 
of the final photon) for scattering of a polarized photon by an unpolarized electron. 
Denoting this cross-section by da(&), we have 


do(€) = 397(@'|w)F do’, (87.14) 


where 
Pelt ois= ae =o) sin? 9. (87.15) 


We see that the scattering cross-section for photons polarized perpendicular to the 
scattering plane (€; = 1) is greater than that for photons polarized in the scattering 
plane (é; = — 1). The cross-section is independent of circular polarization and of the 
parameter &,. The scattering cross-section is therefore equal to that for unpolarized 
photons if there is no linear polarization relative to the x and y axes (&; = 0) or even 
if there is polarization relative to directions at 45° to these axes. 

The cross-section for scattering of unpolarized photons with detection of a 
polarized photon has similar properties. This cross-section, which we denote by 
do (&'), is obtained from (87.12) by putting & = 0: 


do(€') = 4r(w'lw)F' do’,  F'= Fo+ &4F3. (87.16) 
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From formula (87.12) it is also possible to deduce the polarization of the 
secondary photon itself; we shall denote the parameters of this polarization by &” 
to distinguish them from the detected polarization &’. According to the rules given 
in §65, the quantities €{ are equal to the ratios of the coefficients of the £/ to the 
term independent of &: 


EY? = (Fiui/F)é, ES? = (Fy/F)&, EY) a (F3 + F33&3)/F. (87.17) 
In particular, for the scattering of an unpolarized photon 


2 
)_ sin’ 0 
g wlw'+ w'/w — sin’? 0 


éf) = éf =0, (87.18) 


Here €¥) >0, i.e. the secondary photon is polarized perpendicular to the scattering 
plane. Circular polarization of the secondary photon occurs only if the primary 
photon is circularly polarized: é¥ # 0 only if &, #0. 

Let us consider the case of complete linear polarization of the incident photon 
(€.=0, {+ €3=1), and find the cross-section for scattering with detection of a 
linearly polarized secondary photon. Expressing the parameters & and &€; in terms 
of the components of the photon polarization vectors e and e’, we obtain the 
following expression for the scattering cross-section: 


nr 2 , 
= ire(<) (2+ 2-2+4c0s?@) doe (87.19) 
@ @ @ 


where © is the angle between the directions of polarization of the incident and 
scattered photons.t 

According to this formula, the cross-section behaves quite differently when the 
polarizations e and e’ are perpendicular and when they are parallel. Distinguishing 
these two cases by the suffixes 1 and ||, we have in the non-relativistic limit (# <m, 
w' = w) 


do, = 0, do\ = r? cos’ @ do’, (87.20) 


in agreement with the classical formulae. In the opposite, ultra-relativistic, case we 
have w>m, w'’=m/(1—cos 0). Here the two ranges of large and small angles 
(large and small w/w’) must be distinguished: 


m do’ 
w(1—cos 0) 


da,=0, doy=recos’@do’ for 0’ <mlo. 


, 
1.2 1.2 2 ‘ 
do, = doy= are do! = are for 0° >m/o; 


(87.21) 


+ Formula (87.19) itself could be more simply derived by writing from the start e = (0,e), e’ = (0, e’) 
in the scattering amplitude (86.3) and continuing the calculation of the squared amplitude in three- 
dimensional form (i.e. separating the time and space components of the 4-vectors). 

On averaging cos’ @=(e-e’) over the directions of e and e’ (using (45.4a)), and doubling the 
cross-section (to sum over e’), we of course return to (86.9). 
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We see that the scattering cross-section has its classical value at very small angles. 
The approximate equality of do, and do, at angles which are not very small 
signifies that in this range, in the ultra-relativistic case, the scattered radiation is 
unpolarized; but it must be emphasized that this conclusion applies specifically to a 
linearly polarized incident photon. From (87.17) it is evident that, for a circularly 
polarized photon in the ultra-relativistic case, €Y) > & - cos #. 


SCATTERING BY POLARIZED ELECTRONS 


For polarized electrons, the calculation of the traces in formula (87.10) becomes 
very laborious, though not difficult in principle. Here we shall give only some of the 
final results of the calculation.t 

In general, the cross-section depends both on the polarization parameters &€ and 
€’ of the initial and final photons, and on the polarizations of the initial and final 
electrons, described by vectors ¢ and CG’. The dependence on each of these 
parameters is linear. The cross-section has the form 


do =tdo (€, &') + sre(w'lw)” dof -Cé+f Ces +g-Cé te CE + Grlilit---}, 
(87.22) 


where doa(&, &’) is the cross-section (87.12). All the terms which contain products of 
two polarization parameters have been written out in (87.22). Terms containing 
products of three or four parameters have been omitted; they are unimportant as 
regards correlations between the polarizations of only two particles, and disappear 
when the polarization parameters of the other two particles are equated to zero. The 
following are the values of some of the coefficients in the laboratory system: 


{= --q —cos 0)(k cos 0 +k’), 


f= 4 (1—cos 3)(k+k’' cos 8), 


1 w + @’ (87.23) 
g=—— (1—cos 9)| (k cos 9 +k’)— (1+ cos aa ro FS kk], 
ae oe ‘ _ wot+o’ AG 
ees (1—cos 9)|(k+k cos 0)—(1+ cos Oo) on a (k k),|. 


The cross-section (87.22) contains no term of the form G- ¢. This signifies that 
the polarization of the electron does not affect the total cross-section (summed 
over &' and @’) for the scattering of unpolarized photons. There is also no term of 
the form G’-7@’. This signifies that, in the scattering of unpolarized photons, the 
recoil electron is unpolarized. 

We see also that the terms bilinear in the polarizations of the electron and 
photon contain only the parameters € and &3; which correspond to circular 


+ Further details may be found in the review articles by H. A. Tolhoek, Reviews of Modern Physics 
28, 277, 1956; W. H. McMaster, ibid. 33, 8, 1961. 
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polarization of the photon. The polarization vectors ¢ and @’ of the electrons appear 
in the form of scalar products f - ¢, etc., which contain only the projections of these 
vectors on the scattering plane. Hence, for example, the cross-section for scatter- 
ing of a polarized photon by a polarized electron, 


do(&, ©) = do(€) + 3r2(w'lw) Ef + C do’, (87.24) 


differs from do(&) only in that the photon is circularly polarized and the electrons 
have a non-zero projection of the mean spin on the scattering plane. For the same 
reason, the recoil electron is polarized only if the photon is circularly polarized; the 
resulting electron polarization vector is then in the scattering plane: 


C° = &¢/F. (87.25) 


SYMMETRY RELATIONS 


Finally, we shall show that the qualitative properties of the polarization effects 
in the scattering of photons by electrons follow from the general requirements of 
symmetry. 

The parameter &€, of circular polarization is a pseudoscalar (see §8). Hence, from 
the requirement of P invariance, terms «é&, (or « 3) in the scattering cross-section 
could occur only as the product of & with some pseudoscalar formed from the 
available vectors k and k’.f But a pseudoscalar cannot be formed from two polar 
vectors. It therefore follows that no such terms can appear in the cross-section. 

The parameters €; and &; of linear polarization are related to the components of 
the two-dimensional (in a plane perpendicular to k) symmetric tensor 


] 
Sap = Apo + pee) 


Hs ( - ary 


In the present case, one of the polarization axes 1s taken to lie along the vector 
v =k Xk’, and the other lies in the plane of k and k’ (along k X v for one photon and 
along k’ xv for the other). Terms «€, could occur in the cross-section only as 
products S,gv.(k’ X v), (or, equivalently, S.gv.k,), etc. But, since v is an axial vector, k 
a polar vector, and S,, a true tensor, such products are not invariant with respect to 
inversion. There are therefore also no terms & €; (or « &}) in the cross-section. Terms 
x &€, (or <&3), however, occur as products S,gv.vg, etc., and are not forbidden by 
considerations of symmetry. 

Terms in the cross-section that are proportional to the electron polarization ¢ 
are not forbidden by parity: such terms could arise from the products €-v of two 
axial vectors. They must, however, be absent in the first non-vanishing ap- 
proximation of perturbation theory, here considered, because the scattering matrix 


+ We are considering the process in the laboratory system, where p = 0, p’ = k—k’. It is evident that 
the relevant consequences of the symmetry requirements (the presence or absence of particular terms in 
the cross-section) will not depend on the choice of the frame of reference. 
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is Hermitian in that approximation (§71). Owing to this property, the square of the 
scattering amplitude (and therefore the cross-section) is unchanged when the initial 
and final states are interchanged. At the same time the cross-section must be 
invariant under time reversal, i.e. interchange of the initial and final states together 
with a change of sign of the momentum and angular momentum vectors of all the 
particles; the Stokes parameters &), &, &; are then unaltered (see 88). On combining 
these two requirements, we find that in the approximation considered the cross- 
section must be unchanged by a change of sign of all the momenta and angular 
momenta without interchange of the initial and final states, i.e. by the trans- 
formation 


k>-k, k>-k, (5-4 ¢>-0 (87.26) 


with € and é’ unaltered. 

The transformation (87.26) changes the sign of the product ¢- wv, and such terms 
therefore cannot appear in the cross-section. It must be emphasized, however, that 
this prohibition is not a consequence of strict requirements of symmetry, and may 
therefore no longer apply in higher approximations of perturbation theory. 

Among the terms of the binary correlation between the polarizations of the 
photons, only those of the form €,&; and &,€; are forbidden by parity, and none of 
those of the photon—electron correlation are forbidden. But all terms of the form 
E:é, & 0, & are forbidden in the first approximation by the requirement of 
invariance under the transformation (87.26). For instance, terms of the form &,é’, 
and &,¢ could be formed (so far as parity is concerned) as scalars such as &3S.gk iv, 
and Syskivg6-k, but such combinations change sign under the transformation 
(87.26). 

The allowed correlation terms of'the form &¢ can be formed as products of the 
type &.¢-k. The electron polarization vectors appear in them only as projections on 
the scattering plane. 

Finally, a number of relations between the coefficients in the allowed terms 
result from the requirements of crossing symmetry. Reaction channels which differ 
by an interchange of initial and final photons correspond to the same process— 
scattering of a photon by an electron. The squared modulus of the amplitude, and 
therefore the scattering cross-section, must consequently be invariant under a 
transformation which expresses the change from one of these channels to the 
other: 


k ok’, e<e'* 


with the electron momenta and polarizations unchanged. In three-dimensional 
form, this transformation is 


wOO-w’, ko —k’, 


ier, 2 — G2, Fs &. 


(87.27) 


The change in the sign of & is evident from the expression &) = ie X e* -n, in which 
the vector eX e* changes sign when e and e* are interchanged, while the vector 
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n=k/w is unchanged when k@—k, w @— w. The transformation (87.27) does not 
affect the electron momenta and therefore leaves the laboratory system unaltered. 
Hence the cross-section (87.22) cannot change its form under this transformation, 
and in fact the formulae (87.12), (87.22), (87.23) comply with this requirement. 


$88. Two-photon annihilation of an electron pair 


the annihilation of an electron and a positron (with 4-momenta p-_ and p,) to 
form two photons (k; and k2) corresponds to two diagrams 


k, oot Seg Sm p_ k, —— p_ 
| | (88.1) 
Kaa “Pp, Ki eas oe —p, 
These differ from the diagrams for scattering of a photon by an electron as follows: 


p> p-, Peps k>—-k, k' > kp. (88.2) 


The two processes are two cross-channels of the same (generalized) reaction. After 
the changes (88.2), the kinematic invariants (86.2) become 


s =(p_—k,)’, 
t=(p_+p,)?= (ki + ky)’, (88.3) 
u = (p- k,)’. 


If the photon scattering is the s channel, then the annihilation is the t channel. 

The quantity |M,|’ for annihilation (averaged over polarizations of the electrons 
and summed over those of the photons), when expressed in terms of the invariants 
s and u, is the same as corresponding quantity for scattering, only the meaning of 
the invariants being changed.t In the formula (64.23) for the cross-section, the 
change st is needed in the coefficients of |M,|’, and I’ is now, according to 
(64.15a), equal to 4t(t — 4m”). Making the appropriate alterations in formula (86.6), 
we find the annihilation cross-section 


2 2 2 Z 
= » mds m m 
do = Barts am ema me) * 


m? m? 1/s-m* u-m 
Ht ee) ead 


The physical region of the annihilation channel is region II in Fig. 7 ($67). For 
given t (given energy in the centre-of-mass system), the range of variation of s 1s 


+ This takes account of the fact that the photons and the electrons have the same number of 
independent polarizations (two), and it is therefore immaterial which correspond to the averaging of 
|M;i| and which to the summation. 
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determined by the equation of the boundary su = m*. Together with the relation 
s+tt+u=2m’, this gives 


—3t —3V [t(t —4m’)] < 5s — m? <—3t +3V[t(t —4m?)]. (88.5) 


The integration of (88.4) is elementary; the result must be divided by two to 
take account of the identity of the two final particles (the photons). Thus we have 


o= me ay {Ge + 7-34) log ae ae —(t+1I)V[t(r - ni}, (88.6) 


where + =4t/m? (P. A. M. Dirac, 1930). 
In the non-relativistic limit (t > 1), this gives 


o = 5arr2|V (7 — 1). (88.7) 
In the ultra-relativistic case (7 > &), 
o = Grr2/r)(log 47 — 1). (88.8) 


In the laboratory system, in which one particle (say the electron) is at rest 
before the collision, the invariant 7 is 


ee ee (88.9) 


Formulae (88.6)—(88.8) give as the dependence of the total cross-section on the 
energy of the incident positron 


_ ome (fy? +4y +1 2 Vi 
gare a logly + V(y 1)] Viy7-1 ' (88.10) 


In particular, in the non-relativistic limitt 
o =r./v, (non-relativistic), (88.11) 


where v, is the velocity of the positron. 
In the centre-of-mass system the electron, the positron and the two photons have 
equal energies, ¢ = w. The invariants are 


m’— s =2e(e — |p| cos 0), 
m’—u = 2e(e + |p| cos 8), (88.12) 
t=4e’, 


where @ is the angle between the momentum of the electron and that of one of the 


+ This formula becomes inapplicable, however, when v+<a and the Coulomb interaction of the 
components of the pair cannot be neglected; cf. the end of §94. 
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photons. Substituting in (88.4), we find the angular distribution of the annihilation 
photons: 


tae rem? | p(i+sin’ 6) __2p*sin’ 6 | A (88.13) 


~ 4elp]L e’-—p’cos’@ = (e”— pcos’ 6) 
In the ultra-relativistic case this has symmetrical maxima in the directions 0 =0 
and 6 = 7. Near 6 = 0, we have 


r2m? do 


dao =~ 2640? + me (ultra-relativistic). (88.14) 


The total cross-section is obtained from (88.6): 


weer te: a4 
o = ar E = logg > — 22- v9), (88.15) 


where v = |p|/e = V(e’— m’)/e is the velocity of the colliding particles. 

We shall not discuss here the details of the polarization effects in annihilation,? 
but merely consider certain qualitative features of these effects in the limiting cases 
where the velocity v of the colliding particles is large or small. The process will be 
considered in the centre-of-mass system. 

In the limit v >0, only the state with orbital angular momentum of relative 
motion | = 0 gives a non-zero contribution to the cross-section. But the S state of 
the electron + positron system has negative parity ($27, Problem). In odd states of a 
two-photon system, their polarizations are orthogonal ($9). The same must there- 
fore be true of the annihilation photons in the non-relativistic case. 

If the electron and positron are polarized, their annihilation is possible (again in 
the non-relativistic case) only if their spins are antiparallel: since the annihilation 
occurs in the S state, the total angular momentum of the system is equal to the 
total spin of the particles, which is 1 when the spins are parallel. The two-photon 
system, however, has no state with total angular momentum I (see $9). 

In the ultra-relativistic limit (v > 1), the annihilation of a longitudinally polarized 
(helical) electron and positron is possible only when their helicities have opposite 
signs.+ In this limit, helical particles behave as neutrinos (see the end of §80), and 
the electron and positron undergoing annihilation must be analogous to a neutrino 
and an antineutrino, whence the result stated follows. 

The annihilation of an electron and a positron with the same helicity occurs, in 
the ultra-relativistic case, only when terms containing m are taken into account. The 
amplitude of this process differs, in order of magnitude, by a factor m/e from that 
of the annihilation of a pair with parallel spins; the cross-section accordingly differs 
by a factor (m/e). 

+t See W. H. McMaster, Reviews of Modern Physics 33, 8, 1961. 


t+ Since the directions of the particle momenta are also opposite (in the centre-of-mass system), 
helicities of opposite sign correspond to parallel spins. 
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PROBLEM 


Find the cross-section for the formation of an electron pair in the collision of two photons (G. Breit 
and J. A. Wheeler, 1934). 


SOLUTION. This is the process inverse to the two-photon annihilation of an electron pair. The 
squared amplitudes are the same for the two processes, and their relationship to the cross-section differs 
only in that here I? = (k,k,)’ = 4t”. Hence 


t— 4m?’ 
dG torm = doann t 


In the centre-of-mass system (t = 4c’ = 4”), 

do torm = v’ Gains 
where v is the velocity of the components of the pair. In integrating to obtain the total cross-section, the 
result is not to be divided by 2 (as in the case of annihilation), because the two final particles (electron 


and positron) are not identical. Hence, in the centre-of-mass system, 


ay) 2 
Oform — £U Gann 


= bari - »?)}3 v4) log 


pe 202- v9} (1) 


In an arbitrary frame of reference K, in which the two photons k; and k2 are moving in opposite 
directions, we have (from the invariance of kk2) 


2 
W1@2—- @ , 


where w is the energy of the photons in the centre-of-mass system. Since this energy is equal to that of 
the pair components, we have w = ¢ = m/V(1— v’). To change to the frame K, we must therefore put in 


(1) 
v= V4 = m?/@1@2). 


§89. Annihilation of positronium 


Owing to the conservation of momentum, the annihilation of the electron and 
positron in positronium must be accompanied by the emission of at least two 
photons. Such a decay is possible (in the ground state), however, only for 
parapositronium. In §9 we have shown that the total angular momentum of a 
two-photon system cannot be 1. Hence orthopositronium in the *S, state cannot 
decay into two photons. Moreover, since positronium in the *S, state is a charge- 
odd system (see §27, Problem), Furry’s theorem (879) shows that it cannot decay 
into any even number of photons. In the ‘Sy state, on the other hand, positronium is 
charge-even, and the decay of parapositronium into any odd number of photons is 
therefore forbidden. 

The main process which determines the lifetime of positronium is therefore 
two-photon annihilation for parapositronium and three-photon annihilation for 
orthopositronium (I. Ya. Pomeranchuk, 1948). The decay probability can be related 
to the cross-section for annihilation of a free pair. 

The electron and positron momenta in positronium are ~me?’/h, i.e. small 
compared with mc. Hence, in calculating the probability of annihilation, we can 
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take the limit of two particles at rest at the origin. Let @,, be the cross-section for 
two-photon annihilation of a free pair, averaged over the spin directions of both 
particles. In the non-relativistic limit, according to (88.11), 


G2, = 1(e’*/mc’)’c/v, (89.1) 


where v is the relative velocity of the particles. The annihilation probability w2, is 
obtained on multiplying a), by the flux density v|W(0)|’. Here y(r) is the wave 
function, normalized to unity, of the positronium ground state: 


W(r) = Torah ere a = 2h?/me’; (89.2) 


the Bohr radius a for positronium is twice that for the hydrogen atom, because its 
reduced mass is half as great. This probability, however, corresponds to the initial 
state averaged over spins, whereas in positronium, of the four possible spin states 
of a two-particle system, only one (with total spin 0) can undergo two-photon 
annihilation. Hence the mean decay probability w., is related to the paraposi- 
tronium decay probability wo by w2, = iWo, and so 


Wo = 4(O)[(vd2y) v0. (89.3) 


Substituting the values from (89.1), (89.2), we obtain for the lifetime of paraposi- 
tronium 


7) = 2hlmc’a? = 1.23 x 10°" sec. (89.4) 
It should be noticed that the level width Ip) = h/t) is small compared with the 
level energy 
|E.| = me*/4h? = mc’a*/4. 
For this reason positronium may be regarded as a system in a quasi-stationary 
State. 


Similarly we find that the decay probability for orthopositronium is related to 
the spin-averaged cross-section for three-photon annihilation of a free pair by 


W,> 33, = 31U6(0)|"( Vey )v0, (89.5) 


the statistical weight of a state with spin 1 being 3. Anticipating, we may mention 
that 


_ A? -9 22 
63, = AEE 6 (4). (89.6) 


The lifetime of orthopositronium is therefore 


On 


h E 
> Ur —9 mera® = 1.4x 107’ sec. (89.7) 


+ Formulae (89.1)-(89.7) are written in ordinary units. 
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The inequality [,<|E,,| is here, of course, satisfied even more markedly than for 
parapositronium. 

Let us now calculate the cross-section for three-photon annihilation of a free 
pair (A. Ore and J. L. Powell, 1949). According to (64.18), the cross-section in the 
centre-of-mass system is expressed in terms of the squared amplitude by 


d°k, d°k, d*k; 
(27)?2w, - 202-203’ 
(89.8) 


4 2 
do, = ee (kK; +k, + k3)d(@ + @2 + W3—- 2m) 


where, according to (64.16), I=2m-5mv=m7’v, v being the relative velocity 
(assumed small) of the positron and the electron; k;, k2, k3 and w;, w2, w3 are the 
wave vectors and frequencies of the photons formed; the delta functions express 
the laws of conservation of energy and momentum. Because of these laws, the 
three frequencies w,, w2, w; must be represented by the lengths of the sides of a 
triangle with perimeter 2m. Thus the magnitudes of the momenta k,, k, k; and the 
angles between them are entirely determined by specifying two frequencies. 
The three-photon annihilation corresponds to the diagram 


k, ec eae Si i p_ 
k» ep Se ety 
k3 Se eC a ~P, 


and a further five diagrams obtained from it by interchanging the photons k,, k», 
k;. The amplitude may be written 


My = (477)*2e0*e* ea (— p.Q™MU(p-), (89.9) 
where 
Q'‘ = 2s y*G(k3 — px)y"G(p_— ky’, (89.10) 


the sum being taken over all interchanges of the photon numbers 1, 2, 3 together 
with corresponding simultaneous interchanges of the tensor indices A, p, v. The 
squared modulus of the amplitude, averaged over the polarizations of the electron 
and the positron and summed over those of the photons, is 


4 [My = (427) tr{p.Q”””p-Quus}, (89.11) 


polar. 


where 
p-=3(yp-+m), p+ = 3(yps —- m). 


The matrices Q™” differ from the matrices Q*”” in that the order of the factors is 
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reversed in each term of the sum. In the limiting case considered, where the 
electron and positron velocities are small, their 3-momenta p_ and p, may be taken 
as zero, putting p- = p, =(m,0). Then the electron Green’s functions are 


py Pa ykitm yk tm’ +) 
GPK) = yam Ime, 


etc., and the density matrices reduce to 
p= =2m(y° + 1). 


A large number of terms arise on carrying out the multiplication in (89.11), but 
the number that need to be calculated can be greatly reduced by making full use of 
the symmetry with respect to interchanges of photons. For example, it is sufficient 
to multiply out the six terms in Q™” (89.10) each with only one term in Og In the 
six traces then remaining, we can again select certain parts which are transformed 
into one another by various interchanges of photons. The products of the 4-vectors 
p, k1, kx, kz; which occur when the traces are expanded can all be expressed in terms 
of the frequencies @), w2, 3. Since p=(m,0), we have pk,;=ma,,.... The 
products k,k,,... are determined from the equation of conservation of 4-momen- 
tum: 2p = k,+k,+k3; for example, writing this equation in the form 2p —k;= 
k, +k, and squaring, we have 


kik, a 2m(m a W3), ee (89.12) 


The result of the calculation, which is still fairly lengthy, is 


1S Mif-=Gny'et- sf (™) + () + (“aa) | 


polar. W2W3 W 1 W3 W1W?2 


Substituting this expression in (89.8), we obtain the differential cross-section for 
three-photon annihilation: 


s ne m—w}\’ me) Mm — w3 ] 
ta = in v ( W203 a (103 e W1W2 ) ‘ 
d?’k, d*k, d’k; 


X 8(K; + ky + k3)5(@ + @2 + @3— 2m) 10203 


(89.13) 


The delta functions have still to be eliminated. The first is removed by 
integrating over d°k3, and we then write 


d?k, d°k, > 42rw? dw, - 2773 d(cos 942) da, 


where 6,2 is the angle between k, and k); it is assumed that the integration has 
already been performed over the directions of k,; and the azimuth of k, relative to 
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k,. Differentiating the equation 
03 = V(wit 03+ 20102 C08 Oy), 
we find 
d COS 017 = (@3/@1 2) dw3. 


The second delta function is removed by integrating over dw3. The resulting 
cross-section for annihilation with formation of photons having specified energies 
iS 


6 os 2 ne z as 2 
dis, = 59 |(™—2) + (— 2) + (BON) } dans dur; (89.14) 


W1W2 W1W3 W2W3 


the factor 1/6 has been included in order to take account of the identity of the 
photons in the subsequent integration over frequencies (cf. the third footnote to 
$64). 

Each of the frequencies w;, w2, w3; can take values between 0 and m; the latter 
can be reached by two frequencies when the third is zero. For given a, the 
frequency w, varies between m—w, and m. Integrating (89.14) over dw, between 
these limits, we obtain the spectral distribution of decay photons: 


dos, = (8e°/3um?)F(w,) dar, 


_ o(m — @4) moor Peeled _ 2mm iT m— aor 


Ov = Oma a 0} On a) 


The function F(;) increases monotonically from zero when w, = 0 to unity when 
w;, = m, and is shown graphically in Fig. 14. 
The total annihilation cross-section is obtained by integrating (89.14) over both 
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frequencies: 


m 


m 
= 4e°® (w,+w2.—-m) 

Oi = Baa 3 wee dw, da». 
0 M—Wy 


The value of the integral is (7? — 9)/3, and we thus return to formula (89.6). 


§90. Synchrotron radiation 


According to the classical theory (Fields, 874), an ultra-relativistic electron 
moving in a constant magnetic field H emits a quasi-continuous spectrum with a 
maximum at the frequency 


w ~ wolelmy, (90.1) 
where 
wy = v|e|H/|p| 
=~ |e|H]/e (90.2) 


is the frequency of revolution of an electron having energy « in a circular orbit (in 
a plane perpendicular to the field).t We shall assume that the longitudinal velocity 
of the electron (parallel to H) is zero, as can always be achieved by a suitable choice 
of the frame of reference. 

Quantum effects in synchrotron radiation originate in two ways: from the 
quantization of the motion of the electron, and from the quantum recoil when a 
photon is emitted. The latter is determined by the ratio hw/e, and this must be 
small if the classical theory is applicable. It is therefore convenient to use the 
parameter 


_HAhpl_ He hwo (£.) 
x Ho m Hom E ; (70:3) 


where Hy = m?/|e|h (= m’c?/|e|h) = 4.4 x 10° G. In the classical case, y ~ hw/e <1. 
In the opposite limit (vy > 1), the energy of the emitted photon hw ~ e, and (as we 
shall see below) the significant region of the spectrum extends to frequencies at 
which the electron energy after the emission is 


e'~ mH)/H. (90.4) 
If the electron remains ultra-relativistic, the field must satisfy the condition 
H/H, <1. (90.5) 


The quantization of the electron motion itself is expressed by the ratio hwo/e; 
hwo is the interval between adjacent energy levels for motion in a magnetic field. 


+ In this section we shall put c = 1 but retain factors of h. 
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Since 


hwo/e = (H/Hy)(m!/e)’, 


it follows from (90.5) that hwo < «, i.e. the motion of the electron is quasi-classical 
for all values of y. That 1s, the non-commutativity between the operators of 
dynamical variables of the electron (quantities of order hwy/e) may be neglected, 
while the non-commutativity of these operators with those of the photon field 
(quantities of order haw/e) is not neglected.t 

The quasi-classical wave functions of stationary states of an electron in an 
external field can be put in the symbolic form 


1 


a\ ,—(i/n)Ht 
VOH) u(p) e (r), (90.6) 


w= 


where @(r) ~ exp(iS/h) are the quasi-classical wave functions of a spinless particle 
(S(r) being its classical action); u(p) is the operator bispinor 


_— ([V(H+m)w 
u(p) = ; 
—_—_-__(¢:p)w 
V(H +m) P) 
obtained from the bispinor plane-wave amplitude u(p) (23.9) on replacing p and ¢ 
by the operatorst 


p=P-eA=—ihV-eA, H=V(p'+m)), 


where P is the generalized momentum of the particle in a field with vector potential 
A(r). The order of the operator factors in w is immaterial, since their non- 
commutativity is neglected, and the spin state of the electron is determined by the 
three-dimensional spinor w. 

In order to calculate the probability of photon emission in the quasi-classical 
case, it is more convenient to start not from the final formula (44.3) of perturbation 
theory but from a formula in which the integration with respect to time has not yet 
been carried out. For the total (over all time) differential probability we have§ 


dw = ps | a,i| Qry” afi = J V;i(t) dt (90.7) 


+ The full solution of the quantum problem of synchrotron radiation was first given by N. P. 
Klepikov (1954), and the first quantum correction to the classical formula by A. A. Sokolov, N. P. 
Klepikov and I. M. Ternov (1952). The derivation given here, which explicitly makes use of the fact that 
the motion is quasi-classical, is due to V. N. Baier and V. M. Katkov (1967). A similar method had been 
used earlier by J. Schwinger (1954) to derive the first quantum correction in the radiation intensity. 

+ In this section, unlike Chapter IV, the generalized momentum is denoted by the capital letter P, 
while p denotes the ordinary (kinetic) momentum. 

§ Putting Vs (t) = Vg exp(iwsit), we find as = 27V 46 (wsi), and, since the squared delta function is to be 
taken as [5(w)]° > (t/27r)5(w), where t is the total observation time (cf. the derivation of (64.5)), we obtain 
from (90.7) the formula (44.3) for the probability per unit time. 
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(cf. QM, (41.2)); the summation 1s over final states of the electron. 
Using (90.6), we can write the matrix element V;(t) for emission of a photon a, 
k in the operator form 


_ ev (4m) x ,(@i/nHt U (p) iot—ik-r¢o% , U(p) Gin ft 
Vi) = ~ 7h) lorem gee Mao 


where the operators in the square brackets act to the left; the photon field is taken 
in the three-dimensionally transverse gauge. The factors exp(+iHt/h) convert the 
Schrodinger operators between them into explicitly time-dependent operators of 
the Heisenberg representation. We can write V;;(t) in the form 


Valt) = YD GiQQnliy eX 


where Q(t) denotes the Heisenberg operator 


O(t) = uj(B) ( *)e ~ik - F(t) Up) (90.8) 
V (2H) a VQH) 


and the matrix element is taken with respect to the functions dy, qi. 
The summation in (90.7) is taken over all final wave functions ¢;, and is effected 
by means of the equation 


2 b¥(r')b;(r) = S(r’ — 1), 


which expresses the completeness of the set of functions ¢;. The result is 


3 
dw = < ‘s at, | dts - ei 1Q*(4)Q(tyli). (90.9) 


If the integration is over a sufficiently long time interval, t; and t,. can be replaced 
by new variables 
T=h-h, t=2(ti+h), 


and in the integral over t the integrand may be regarded as the probability of 
emission per unit time. Multiplying by iw, we obtain the intensity 


2 
dl = £5 d?k | e-*HQ*(t +42) Q(t — 42) |i) dr. (90.10) 


An ultra-relativistic electron radiates into a narrow cone at angles @~ml/e 
relative to its velocity v. The emission in a given direction n = k/w therefore occurs 
over a section of the path in which v turns through an angle ~m/e. This section is 
traversed in a time 7 such that 7|v¥| ~ tw) ~ m/e <1. This region gives the principal 
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contribution to the integral over 7+. In the subsequent calculations, we shall 
therefore expand all quantities in powers of wot. It may, however, be necessary to 
retain more than just the leading term in the expansion, because of cancellations 
which occur since 1—n- vy ~ 6? ~ (m/e)’. 

If the operator Q*Q is reduced to a product of operators which commute (to 
the necessary degree of accuracy), the taking of the diagonal matrix element 
(i|...|i) is equivalent to replacing these operators by the classical (time-dependent) 
values of the corresponding quantities. This is achieved in the following way. 

According to the foregoing discussion, in the expression for Q(t) only the 
non-commutativity of the electron operators with the photon field operator 
exp(— ik: F(t)) need be taken into account. We have 

Pp ger = e* 5 a hk), 
ae. ae (90.11) 
H(p)e “*'=e “''H(p— hk). 


These formulae follow because e ‘“’' is the displacement operator in momentum 


space. Using (90.11), we can take the operator e“’"? out on the left in 
(90.8), and write Q(t) in the form 
On=e* RD, R= PD (a. er) HO) (90.12) 
V(2H’') V (2H) 
where H'= H — ho, p' = p— hk. 
Then 
030: =Rre* Pe“ AR, (90.13) 


here and henceforward, the suffixes 1 and 2 denote the values of quantities at the 
times t)=t—347 and t.=t+4r. It remains to calculate the product of the two 
non-commuting operators e'“’” and e “""!, This product itself may be regarded as 
commuting with the remaining factors. 

We write 


L(r) = e #7 eff pie tr, (90.14) 


this being the combination of operators which appears in (90.10). The operator e”* 


is a time-shift operator, and so 
eik- fo pifltih pik-#, 9—iArin 


Substituting this in (90.14) and noting that e'“’"" is a displacement operator in 
momentum space, we find 


L(+) = exp{i[H — hw]z/h} exp{—iH (p, — hk)7/h}. (90.15) 


Differentiating (90.15) with respect to 7 and again using the properties of the 
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time-shift operator, we havet 
dL.|dr = (i/h) exp{i(H — hw)/h}[H — ho — H(p— hk)] x 
x exp{—iH (p1 — hk)7/h} = (i/) LH — ho — A (pf) — hk) JL (7). (90.16) 


Having thus made use of the non-commutativity of the operators, we can 
replace all the operators by the corresponding classical quantities (the Hamiltonian 
H by the electron energy ¢). We have identically 


€(p2— hk) = [(p2— hk)? + m7)" 
=[(e —hw) + 2h(we —k > p»)]!”. 
The difference 
we —k+ po = we(1—n- v2) 


is small, since from the above analysis 1 vy: n~(m/e)’. As far as the first order in 
this difference, 


e(p2— hk) = e+ (e/e')h(w@ —k + v2), 
where ¢’ = ¢ — hw. From (90.16), we now find the differential equation for L(r): 
indL|dt = (e/e')\h(w —k- v2) L. (90.17) 


This equation is to be solved with the obvious initial condition L(Q) = 1. Since 


vy. dt =Mm—N'1, 


we have 
L(r) = exp{i(e/e')(k + r2—_k +r — w7)}. (90.18) 
So far, no use has been made of the specific form of the electron trajectory. 


Now expressing r,—r, in (90.18) in terms of p; by means of the equation of motion 
of the electron in the plane perpendicular to the field H (see Fields, §21): 


pi . eH Bar| 7 | 
m—-r,= oH 52 - —— + oH? 1 cos ~~], 


and expanding in powers of 7 gives 


: 2n72 
k(n) or ~ orf(n- vi +7 2 PAE 2 e°H \, 


where in the last term we have put n- vy, = 1. 


+ Because of the conservation of energy, the Heisenberg operators H (p1) and H (p2) are the same; we 
therefore omit the arguments of H in such cases. However, H (pi— hk) is of course not the same as 
H (po — hk). 
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We next transform the remaining factors in (90.13). A direct expansion of the 
product in R(t), using the matrix a from (21.20), leads to 


R(t) = wfe* -(A+iB X o)wi, 


nx Sf I =)-So 

A=b(l +5 = per (90.20) 
_if/ PP) ho 

B (—P —P_) ~ 58 -(n—v+vm/e), 


where p'(t) = p(t) 4k; terms of higher order in m/e are omitted. Thus we have 
finally 


e~"(i1Q3Q,|i) = RERiL(1), 


1 , ‘ (90.21) 
RR, = trx14+; + o)(A2— iB, X o)- e+ 3(14+ G - (A, + iB, X o) - e*. 
The factors 3(1+¢+o) are two-rowed polarization density matrices of the initial 
and final electron. 
Let us consider the radiation intensity summed over the polarizations of the 
photon and of the final electron, and averaged over the polarizations of the initial 
electron. These operations give, after a simple calculation,t 


1 24 12 h 2 
5 RIR =e wie +5 5 (Be) (BY. 


polar. 


With sufficient accuracy we can put 


<1 De 12 « 
VOW = VW Kaw tare ¥ 


wor’. 


jm 
ee 


2 


Substitution of these expressions in (90.21) and thence in (90.10) gives 


2 


dI = - 7-4 0? do dog X 


[e.e) 


m> ste IWTE 
x | (% +o ES wir’) exp{- se (1-n- vite od) bdr (90.22) 


E 


—e 


+ This calculation makes use also of the following result. In the summation over e, 
> (v, - e)(v2° e*) = vi - v2 — (vi M)(V2 * n). 


On substituting (90.21) in (90.10), we can integrate by parts, noting that 


ole Me (-ik-n1) 
w dt; P €! i 


and similarly for v2: n. Consequently, in the remaining integration v;:n and v2-n can be replaced by 
unity. 


(v:°n) exp(- i =k . ri) 


QE4-2 


382 Interaction of Electrons with Photons §90 


This formula shows the frequency and angular distribution of the radiation in- 
tensity. 

To find the frequency distribution, we integrate over do,. If the direction of v is 
taken as the polar axis, with an angle 0 between n and v, then 


n:-v=v C08 9, do, = sin 3 dd dd, 


and 


[ex {ere nv Pi as 27rE' {ex (2) -ex (- ere) 
Pye "  LWOTEV Pte P eg’ J} 


When this is substituted in (90.22), only the first term need be retained, since the 
second term yields a faster varying exponential (with a factor 1+ v ~ 2 instead of the 
small 1— v ~ m”/2e”). Hence 


dI _ ie*w m> ¢?+¢e” { iwte ; 
do 7 Sa | (igh aeen 7) em“ (1-0 + Fye8) f ae 


According to the integral representation of the Airy function ® (see QM, 8b), the 
first term reduces to the integral of the Airy function, and the second term to its 
derivative. The final result is 


aT ae i o(8) dé+ (E+ "2 vx oc}, (90.23) 


x = (hol e'y)? = (m7/e”)(ew/ €' wo)” (90.24) 


(A. I. Nikishov and V. I. Ritus, 1967). The frequency distribution has a maximum 
when x ~ 1; for yx <1, we find (90.1), and for y >1 (90.4). In the classical limit, 
ho Ke'~e,x= (w/w)(m/e): the second term in the round brackets is small, and 
(90.23) becomes the classical formula (Fields, (74.13)). 

Figure 15 shows diagrams of the frequency distribution for various values of x. 
The quantity 


1 dI 
31/2 d(w/ we) 
is plotted against w/w,, where 
ss SE = Oe m’y? _ 2e*H*e° 
ha, Phy’ I, 3h2 3m‘ . 


The quantity I, is the classical value of the total radiation intensity; cf. Fields, 
(74.2). 
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To calculate the total radiation intensity, (90.23) must be integrated with respect 
to w from 0 to e. We change to integration with respect to x, noting that 


i 
hw = «(1-70 3m). 


and x therefore varies from 0 to ~. With two integrations by parts in the first term 
in (90.23), we find 


— e’m*y’? A+ Syx??-+ 4y2x? 
[= WV rh? aa ee ee dx. (90.25) 
0 


TT 


Figure 16 shows a graph of the function I(y)/Ig. 
When y <1, the important region in the integral is x~1. Expanding the 
integrand in powers of y and integrating by means of the formula 


i x’®'(x) dx =— we 3¢- DT Ey + IT Gv +3), 


0 


we obtain 


I = 1,(1-2593 y +48y?—-- ), (90.26) 


When y > 1, the important region is that in which yx*” ~ 1, i.e., x <1. In the first 
approximation, we can therefore replace ®'(x) by ®'(0) = —3'"T@/2V 7z, and the 
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integration then leads to the result 


321 (e?m? 
[ae OO 


= 0.37 


em? He 2/3 
h? Gey 


§90 


(90.27) 


Synchrotron emission causes the occurrence of a polarization of electrons 
moving in the field (A. A. Sokolov and I. M. Ternov, 1963). To discuss this, we 


have to find the probability of a radiative transition with spin reversal. 


Putting in (90.21) G =—-G =6, |¢| = 1, we have 


R3R, = By - Bz — (e* - Bi)(e + Bz) — (e* + B, x O)(e - B, x f) — i(€ - e*)(e « B, X By). 


Summation over polarizations of the photon gives, after a simple calculation, 


> R#R, = (B,- BU -(C- n)’) + (€-n)(n- B,)(E - By) + 


+ (C+ n)(n - By) - Bi) — i(¢— n(n - €)) - B, X Bo. 


(90.28) 


We shall assume that y <1 and seek only the principal term in the expansion of 
the probability in powers of h. Since the expression (90.28) (with B given by (90.20)) 
already contains h’, all the remaining quantities ¢’, including those in the exponent 


in (90.18), can be replaced by e. 
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With the expansions 


m 
Bi =< (n—v+irv+v™), 
2€ € 


and substituting (90.28) in (90.21) and thence in (90.10), we find the differential 
transition probability per unit time (dw = dI/hw). The integration over d’k is carried 
out by means of the formula 


Rice ye aks — f(id,) Sere (90.29) 


where in this case 


The result of the calculation is 


SY il Made big 
am \m/) @°F 04+22/12)% (2% 122? 


where z = twoe/m and the contour of integration passes below the real axis and is 
closed in the lower half-plane. After this integration we finally obtain for the total 
probability of a radiative transition with spin reversal 


a) v)- 


w= 


5V3a h* (e\> 3/, 24. 8V3 
= nt (=) oi( j= qe} (90.30) 


Where (;,=C€-v, €¢,=¢€-H/H. This formula is valid for both electrons (e <0) and 
positrons (e > 0). 

The probability (90.30) is independent of the sign of the longitudinal polarization 
¢, but depends on that of ¢,. The polarization resulting from the emission is 
therefore transverse.t For electrons, the probability of a transition from a state 
with the spin parallel to the field (¢, = 1) to a state with the spin antiparallel to the 
field is greater than that of the opposite transition. The radiative polarization of the 
electrons is therefore antiparallel to the field, and the degree of polarization in a 


+ This is also evident from the fact that the axial vector of the resultant polarization must be along 
H, which is the only axial vector occurring in the problem. 
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Stationary state is (when ¢ = 0) 


w(f,=—1N—w(Z,=1)_ 8V3_ 
"Sa Dawa)” 1s 


Positrons are polarized, to the same degree, parallel to the field. 


§91. Pair production by a photon in a magnetic field 


The production of an electron—positron pair by a photon in a magnetic field, and 
synchrotron radiation, are two cross-channels of the same reaction. The amplitude 
M;, of the pair production process is therefore found from the synchrotron 
radiation amplitude by simply making the changes 


£, p> — 4, — ps3 e',p'> €_, p-; w, kK—> — w,— k. (91.1) 


Here e_, p- and «,, p, are the electron and positron energies and momenta; «, p and 
e', p’ the initial and final energies and momenta of the electron in synchrotron 
radiation. In terms of angles and magnitudes, the momenta are transformed 
according to 


lp|>|p-|, |[p'|>lp-]|, 927-6, 0'>6-, d>-¢d-T7, (91.2) 


where @. are the angles between p+ and k, ¢ the angle between the kp, and kp_ 
planes. 

For synchrotron radiation, the cross-section is given in terms of the amplitude 
byt 


d°p'd’k. 


(27r) ’ 


U 


—w) (91.3) 


1 
= |? aoe = 
do = |M;i| 8 plea d(e—e 


see (64.25). The delta function is eliminated by integrating with respect to «’. Since, 
in the present case, p’ and k are independent variables, and 


d°p'=|p'e’de'do', dk =w’ dwdoy, 
we have simply to substitute 
5(e —e'—w) d’p'd’k >a'|pe' do, do’ dw. 


Then 


da = |M,i|’ do, do dw. (91.4) 


ay) , 
8(277)|p| 


+ In this section we again put f = 1 as well as c = 1. 
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For pair production by a photon, the cross-section is given in terms of the 
amplitude by 


d*p. d°p_ 
8(w — 8-2) Re 


_ing 2 
do = |My Swe_e4 


or, after elimination of the delta function, 


at 12 \p+||p-| 
do = |Mji| 8(2ar)Se do, do_ dé,. (91.5) 


Comparison with (91.4) shows that, to obtain the pair production cross-section 
from the synchrotron radiation cross-section, we have to make in (91.4) the 
changes (91.1), multiply by 


(pi/w’) de,/do, (91.6) 


and replace do' do, by do, do_. 

In the ultra-relativistic case w > m,f this can be done in the formulae derived in 
$90. Here it is assumed that both particles in the pair are ultra-relativistic; it is 
easily verified that all the approximations used in §90 then remain valid. 

In particular, the probability of pair production by an unpolarized photon, 
summed over the electron and positron spin projections and integrated over the 
directions of emergence of the electron, is found on making the changes (91.1) in 
(90.22) (or rather in the expression for dI/dw), with d°k = w’ dw do, replaced by 
d*ps: 


LIWTE + ( T 


2 
x exp| ret I-n-v. +37 03.)} dt. (91.7) 


where wo, = |e|H/e+, and n is a unit vector parallel to the photon momentum, which 
lies in the plane perpendicular to the magnetic field. The integration is carried out in 
the same way as in §90, and (since (91.7) depends only on the angle between n and 
v.) it does not matter whether we integrate over do, or over do,. The result can 
therefore be obtained directly by analogy with (90.23): 


T 


ie ne (f b(é) dé + (=- «Vx) oe} (91.8) 


+ More precisely, we must have w sin ¢ > m, where 9 is the angle between k and H; when 6 = 0, no 
pairs are formed. In what follows, we shall take & = 377. 
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where now 


= (m3a/le|He.e), 
(91.9) 


k = |e|Ha/m? (= h’|e|Haw/m’*c’). 


The total pair production probability per unit time is found by integrating (91.8) 
with respect to €4; in view of the obvious symmetry in e, and ¢_=w— €4, it 1s 
sufficient to take twice the integral from 0 to 3. Changing variables from e4 to x and 
integrating by parts in the first term in (91.8), we obtain 


3H 3/2 4 1/2 3 3/2 __ ”) @’ 
| (ee P(x) — Ss aye | dx (91.10) 


~ mk 
(4/«)23 


(A. I. Nikishov and V. I. Ritus, 1967). 

In the limit of weak fields (x <1), values of x near the lower limit are important 
in the integral (91.10). Since these values are large, we can use the asymptotic 
expression for the Airy function, 


1 
P(x) ~ 51 exp(—3x"); 


see QM, $b. With the variable of integration y =x**—4/kx, and putting y =0 
wherever possible, we find by calculation 


3/2) _,|3 
w = EOF oa. K<1., (91.11) 


9 2m 


The exponential decrease of the probability as x—0 corresponds to the im- 
possibility of pair production in the classical limit. 

In the opposite limit of strong fields (x > 1), only the second term in (91.10) is 
important, and it is governed by the range of x for which x*”~ 1/« <1. In that 
range, ®'(x) may be replaced by ®'(0) = — 3'"T'G)/272. With the value of the integral 


io.) 


[ro - De dy =Te - wW)ITO, 


1 
we find 


316 x 5p23 eH 


The function mw (k)/|e|>H has a maximum value of 0.11 at k =~ 11. 


§92 Electron—Nucleus Bremsstrahlung 389 


$92. Electron—nucleus bremsstrahlung. The non-relativistic case 


This section and those following are concerned with the important phenomenon 
of bremsstrahlung, the radiation emitted in a collision between particles. We shall 
first consider a non-relativistic collision between an electron and a nucleus, 
assuming that the nucleus remains at rest; that is, we consider radiation from the 
scattering of an electron in the Coulomb field of a fixed centre (A. Sommerfeld, 
1931). 

We begin from formula (45.5) for the probability of dipole radiation: 


dw =(w*/27)le* + dj|’ doy. (92.1) 


In the present case, the initial and final states of the electron belong to the 
continuous spectrum, and the photon frequency 


w = (1/2m)(p* — p”), (92.2) 


where p= mv and p’= mv are the initial and final momenta of the electron. If the 
initial and final wave functions of the electron are normalized to “‘one particle per 
unit volume’”’ (V = 1) the expression (92.1), on multiplication by d’p'/(27)*> and 
division by the incident flux density v/V = v, will give the cross-section doy, for 
emission of a photon k into the solid angle do, with scattering of the electron into 
the range of states d*p'. Replacing the matrix element of the dipole moment d= er 
by that of the momentum: 


we can write the expression for the cross-section in the formt 
we? 2 3 
_ * op, 23 
AS kp (27r)'mp le* + pyi|’ do, d’p’, (92.3) 
where 
Pri = | vibu d°x =-i | vIVI ds, 
For yw; and W we must use the exact wave functions in an attractive Coulomb field, 
whose asymptotic form consists of a plane wave and a spherical wave. The 


spherical wave must be ingoing in yu, and outgoing in y (see QM, $136). These 
functions are 


= A; e”® 'F(iv, 1,i(pr—p-r)), v = Ze*mlp; 
op (iv, 1, 1(pr—p-r)) 92.4) 


Uy = Aye’? 'F(-iv',1,—-i(p'’r+p'-r)), v' = Ze’m/p', 


+ In this section, p and p' denote |p| and |p'| respectively. 
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with the normalization factors 
A; = e™”’*T(1 — iv), Ay = e™ PT + iv’). (92.5) 
Since 


VF (iv, 1, i(pr— p-r)) = i(pr/r — p)F' 


we can write the gradient Vu; as 


On multiplication by w¥ and integration, the first term vanishes, because wy and Uy 
are orthogonal. The matrix element p,; is therefore 


Pi = LA;AsgpaJ/ 0p, (92.6) 


where J denotes the integral 


—iq:r 
s=[S F iv’, 1, i( Sea 'or))F lV, l, i( r—-p°r d°x, 
ma le a i (92.7) 


q=p —p. 
The symbol @/dp has been taken outside the integral, with the understanding that, in 
the differentiation of J, the quantities v, v', q are to be regarded as independent 
parameters, v and q being expressed in terms of p only after the differentiation. 
The integral is calculated by replacing the confluent hypergeometric functions 
by their expressions as contour integrals. Here we shall give only the result: 


J = BF (iv’, iv, 1, z) 
B = An eo7™(— q os 2q 5 p) "(q? _ 2q . py "qn, (92.8) 


7 = 7 Lipp’ +p: p')—2(q- pq: p’) 
(q'-2q°p'\(q'+2q-p) - 


Here F(iv’, iv, 1, z) is the complete hypergeometric function. 
After differentiating in (92.6), we can put q = p’— p; then 


—2PP—P Pp =(p- pz) (92.9) 


2" (p=py 


+The calculations are given by A. Nordsieck, Physical Review 93, 785, 1954. 
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(z <0). Also, 
—q’—2q°p=q'~ 2q: p'=p’—p">0. 


The matrix element is thus finally found to be 


Sari e™ (2 a ee 
POA es in x 
Pf ‘(p—p'(p+p'’) \p—p 
x (1-2) Lipp qF (z) + (1 — z)F'(z)(p'p— pp’), (92.10) 
where we have put for brevity 
F(z) = F(iv’, tv, 1, z). (92.11) 


The cross-section is obtained by substituting (92.10) in (92.3), but the general 
formula is very lengthy and obscure. We shall therefore go on immediately to 
calculate the spectral distribution of the radiation, i.e. to integrate the cross-section 
over the directions of the photon and the final electron. 

The integration over do, and the summation over the polarizations of the photon 
are equivalent to averaging over all directions e and multiplication by 2 X 47, Le. to 
the substitution 


ee; do, =? (8 17/3) dix. 
The cross-section is then 


d°p’ 


4we? | He 
Pi Qcry 


doy, = or 


Dost 
= eae Ipyi|? dw doy. (92.12) 


The value of |p,i|’ is calculated by using (92.9)-(92.11) and the formula 
[PC — iv)? = av/sinh av. 
The result is 


Ip P = 32m(Ze*)’m? 

i] ye. Ne IN a 2a! aN 
M"  p(p +pY(p — p' (1 - e*" (e?” — 1) 
z 
1-z 


x = JF? —z|F'P+3i(@v t+ v’) 


(FF'* — F*P)}. (92.13) 


To integrate the cross-section (92.12) over do, =27 sin 0 d0, we change from 
the variable & (the scattering angle) to 


2pp' 7(p —p’y 
=——FF J(1-cos3), doy>—*—*~ dz. 
5 (p— py! sa Op pp 
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In order to integrate with respect to z, we transform the expression in the braces in 
(92.13) as follows. According to the differential equation of the hypergeometric 
function (see QM, (e.2)), we have 


z1-—z)F"+|1-C+t+iv+iv'))z|F'+ vv'F = 0, 
z1-—z)F"*+[1—( — iv — iv’)z]F'™* + vv' F* = 0. 


Multiplying these equations by F* and F respectively and adding, we obtain 


ca 2)| + 2(F'F* + FF) —22|F’? + te 


Cs ee oi au  \FP|=0. 
Hence the expression in the braces in (92.13) is seen to be 


pale i a 2(F'F* + FF’*), (92.14) 


and the integration is immediate. 
Collecting the above formulae, we find as the final expression for the brems- 
strahlung emission cross-section in the frequency range dwf 


_ 647° 22 mc? p! p} dw 
do. Cart OF p mee aD (ag ON) Sy 02.19) 


where 


vy = Zamclp = Ze?/hv, v' = Ze’lhv', p' = V(p*—2mho), 
F(€) = F(iv’, iv, 1, ), E = —App'l(p — p'y’. 
Let us consider the limiting case where both velocities v and v’ are so large that 


vy <1, v’<1 (but, of course, still with v <1, so that Za < v <1; this is possible only 
if Z is small). To calculate the derivative F’(€) in this case, we use the formula 


a _ 2B 
i Fle By 2)=2" Fa +1, B+l,y41,2), 


which is easily obtained by simple differentiation of the hypergeometric series. 
Then 


F'(€) ~ iv - iv'F(,, 1, 2, €) 
= (vv'/E) log — ); 


the last equation is evident from a direct comparison of the corresponding series. 


+ Formulae (92.15)-(92.25) are given in ordinary units. 
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For the function F(é) itself, we have simply 
F(é) ~ FO, 0, 1, €) = 


Then, from (92.15), 


2c vtv' dw 
ye = ww. (92.16) 


Ze*/hv <1, Ze*/hv' <1. 

The smallness of v and v’ is just the condition for the Born approximation to be 
valid in the case of Coulomb interaction. Formula (92.16) itself can therefore be 
more simply obtained directly by means of perturbation theory (see Problem 1). 

Now let a fast electron (vy <1) lose a considerable fraction of its energy by 
radiation, so that v’<v and v’ may not be small. Then 

-—&=4p'/p =4p/v' <1, 
F(€) = F(iv’, 0, 1, €) = 1, 
F'(é) = — vp’ F(1 + iv’, 1,2, 2) > — ww’, 


and the cross-section 1s 
647 3.9 n(S) 1 dw 
doy = 30 a v) 1—exp[—27Ze’/hv'] w’ 
Ze7lhv <1, Ze*lhv' = 1. 


(92.17) 


When v’ <1, this formula yields the same limiting expression, 
do, =% Zarz ©, 


as (92.16) does when v’<v. Hence formulae (92.16) and (92.17) jointly cover the 
whole range of v’ (when v <1). 

When w > wo, where hw = 4mv’, the velocity v'>0 and v’>~. In this limiting 
case, (92.17) gives 


3 
fis ae Za? n(<) ee (92.18) 


Thus do,/dw tends to a finite limit as w > wo. This can be explained in a general 
manner by arguments similar to those given in QM, 8147. The physical reason is 
that the frequency wo is the limit only of the continuous bremsstrahlung spectrum. 
The electron can also go to a bound state with emission of a frequency w > wo. But 
highly excited bound states in a Coulomb field have properties almost the same as 
those of the free states near their limit. Hence the boundary between the con- 
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tinuous and the discrete spectrum is not essentially a physically distinctive point. 

Let us now consider the case where both parameters pv, v’>1. The motion of 
both the initial and the final electrons is then quasi-classical. If the condition 
hw <p*/2m is also satisfied, the matrix element too is quasi-classical. Then the 
formula of quantum mechanics must become the result given by the classical 
theory (see Fields, §70). We shall, however, suppose that p’/2m ~ hw, so that we 
need an asymptotic expression for the function F(é) when v, v'>© and é~1;a 
more exact condition will be stated below, (92.24). 

To derive this expression, we start from the integral representation of the 
hypergeometric function, QM, (e.3), writing it as 


—Tpv' 


e 
2Ti 


F(ipv’, iv’, 1, €) = ? cl ~ te" — téy dt’, (92.19) 
a 


where 
p=ovlv', O<p<1, 
so that 
E=—4p/(1— py’. (92.20) 


The contour of integration is taken as shown in Fig. 17, passing along part of the 
real axis and avoiding the points t = 0 and t = 1.t 

When vp, v’>1, the value of the integrand is small on the lower part of this 
contour, and may be neglected. On passing downwards round the point t = 0, the 
integrand is multiplied by the small factor exp(—27pv’'), and on passing upwards 
round t = 1 it is multiplied by exp(27pv’). The integral 


er ge) = t? 
F a atit, f(t) = ilog gaa 


(92.21) 


may be calculated by the saddle-point method. The saddle point to is given by the 
condition f'(to) = 0, whence ty = 3(1— p). At this point, however, the derivative f’(to) 
is also zero, so that we must write 


f(t) ~ f(to) +3iar’, T=1—{), 


t=O t =| 
Fia. 17. 


+ For the hypergeometric function F(a, B, y, €), the contour is to be chosen so that the function 


V(t) es ete rt a 1)'* 


returns to its initial value on passing round the contour. When y is integral (here, y = 1), the contour 
chosen satisfies this condition. 
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where 


F 1 1 
f (to) = 2mp + (1+ p)logs—*, a =5- f"(to) = 


= 16 
1+ p’ 1— 


a a 

(1 — p’)’ 

The coefficient 1/t of the exponential in the integrand may be written 
1/t ~ 1/to — 1/t6; 


we cannot here take simply the term 1/to, as this would reduce to zero the 
derivative d|F(é)|?/dé in (92.15). Thus we have, after an obvious substitution in the 
integrals, 


l ' , 
F ~ Saritlav 3 CXPt- TAY + v'f (to)} X 


ioe) 


x {- | ei? dx + avi? | xe? act. (92.22) 


00 


The two integrals here are, respectively, 


loa) 


1.3 i 21 
2 | eos dx = PEP 
2 | x sindx? dx = 3"%Q), 
0 
The derivative F’(&) is calculated similarly; according to (92.19), it is given by an 


integral which differs from (92.21) only in that the coefficient 1/it of the exponential 
is replaced by v'/(1 — &t). A simple calculation then leads to the result 


ies piel 


Finally, substitution of this in (92.15) gives, with the necessary accuracy, the 
simple expression 


16 mc’? d 
do, = aur Zar? ar ae (92.23) 


The condition for this to be valid, i.e. for the asymptotic formula (92.22) to be valid, 
is that the second term in (92.22) should be much less than the first: (1 — p)v > 1, or, 
expressing the parameters of the hypergeometric function in terms of physical 
quantities, 


hw > (hv/ Ze’) -4mv’. (92.24) 


396 Interaction of Electrons with Photons §92 


This inequality is compatible with the quasi-classicality condition ha <jmv’, i.e. 
1—p <1. When the latter condition is satisfied, the result must be the same as the 
corresponding formula in the classical theory, since on multiplication by hw the 
expression (92.23) becomes the classical formula (Fields, (70.22)) for the “effective 
retardation” in the high-frequency limit.t In order to go to the classical formulae 
throughout the range (l—p)v ~1, v>1, we should have to find the asymptotic 
form of the hypergeometric function when the saddle point is close to the 
singularity t = 0; we shall not deal with this here, since the final result is obvious. 

All the formulae given above refer to an attractive Coulomb field. The cross- 
section for emission in a repulsive field is obtained from (92.15) by changing the 
signs of v and v’. Then, in particular, the limiting Born formula (92.16) remains 
unaltered, but in the limit v <1, v' ~~ we have instead of (92.18) 


_ 1287; n(£) (—Vfomednear) hide 
do, = 3 Zar. >) &XP Gone) Serre (92.25) 


i.e. the differential cross-section tends exponentially to zero as w > wo. This result 
also is reasonable: in a repulsive field, there are no bound states, and the frequency 
wo is the true boundary of the radiation spectrum. 


PROBLEMS 


PROBLEM I. In the Born approximation, find the bremsstrahlung cross-section for a non-relativistic 
collision of two particles having different values of the ratio e/m. 


SOLUTION. The dipole moment of two particles with charges e1, e2 and masses mj, m2, in their 
centre-of-mass system, is 


where p = mim2/(m,+ m2), r=r;i—1r2. Hence 


j= (2 - =) pit =- (4-2 yo 
mi m2 my, my? r 
The matrix element is 
eee 1 Fy Be? aan 
p'p — att )e'p> w =(p p’)/ Bb, 


where p= LV, p’ = pv’ are the momenta of relative motion, and it is calculated from the plane wavest 


p=e'?™, by = eh F 


by means of the formula 


+ The agreement of the formula (92.23) for hw do. with the classical formula, when the one condition 
(92.24) is satisfied, is to some extent accidental. In the classical formula, the difference between v and v’ is 
beyond the accuracy postulated, and there is no réason to identify vo in Fields, (70.22) with v specifically; if 
vo is identified with v’, there is no longer agreement with (92.23). 

+ The replacement of two particles by a single particle having the reduced mass is, of course, 
permissible only in the non-relativistic case. 
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The result is 
2,2 2d 2 
(2 2102 (er VM Oo ew. g) A aa, 
doxp os (2 <2) ; a (e+ q)(e* : q) = doy dox. 
After summation over polarizations, the angular distribution of the radiation is given by a factor 
sin’ ®, where © is the angle between the direction of the photon k and the vector q, which lies in the 
scattering plane (see (45.4a)). 
After integration over the directions of the photon we have 


6 22/1 €2 2 v' dw sin 6 dé 
dows = 3e1e2 Se ay eae) ee Fi A > 
mi mM2/ vw vit+tv'~—2vv' cos @ 


where @ is the scattering angle. Finally, integration with respect to @ gives 


2 

1 22/1 e@2\ 1 v+v' dw 
do. = 3e1e2| — — — } —log _—, 
v v—v' w 


mm, mo 


For radiation in the field of a fixed centre of Coulomb force, this formula is equivalent to (92.16). 


PROBLEM 2. In the Born approximation, find the bremsstrahlung cross-section for a non-relativistic 
collision of two electrons.t 


SOLUTION. In this case there is no dipole radiation, and we must therefore consider quadrupole 
radiation. In the classical theory, the spectral distribution of the total intensity of quadrupole radiation is 
given by 


me 96|(Dik) ool”, 


where Dy = = e(3xix, — 15%) is the quadrupole moment tensor of a system of charges.t For two 
electrons we have, in their centre-of-mass system, 


1 2 
Dix = 2€(BXiXk — Six), r=r)}—fr. 


In the quantum theory, the Fourier components must be replaced by the matrix elements (cf. the 
discussion of dipole radiation in §45), and, with appropriate normalization of the wave functions (plane 
waves) and division by the photon energy w, we obtain the cross-section for emission of radiation with 
scattering of the electrons into the range of states d’p 


doy = a5 (Buool? = it coe 


where v = 2p/m is the initial velocity of relative motion; the emitted frequency w = (p?—p”)/m. 
The operator Dx is calculated by threefold commutation of the operator Dx with the Hamiltonian 


> 
Ne 


H= 


3 [> 


é€ 
she 


and is§ 


aA 


ie 2e° Xi MEX x. NXE agate 
Bu == | 6(% bs + be) +6(%5 6 +B: *5) - 9( — + pH) bu (5 6+ 615), 


+ The collision velocity v satisfies the conditions a <e’/hv <1. The classical case (e’/fv > 1) is 
discussed in Fields, §71, Problem. 

+t This formula is obtained from Fields (71.5) in the same way as (67.11) in that book is derived from 
(67.8). 

§ This expression is analogous to the classical form 


.  4e? 1 
Bu = 75 |6% ps. +63 pi 9S p r— Bap: r, 


which would be obtained on differentiating Dx and using the classical equation of motion: 
2mé = e'r|r’. 


QE4-AA 
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Since the two particles (electrons) are identical, the matrix elements are calculated from the wave 
functions 


1 ip-r —iper 1 ip’-r -ip':r 
Wp = 75 (e” aa’? ), Wy = 75 (e* aoe ees ), 


where the signs + and — correspond to total spins 0 and 1 of the electrons (interchange of the electrons 
corresponds to changing the sign of r). 
The lengthy calculations lead to the following formula for the spectral distribution of the radiation: 


4 9f,, 3x? 12Q2-—x)*-72—x)’x?-3x* ss 1) VI =- x) 
dow = tari 17 (2—xy* (2—x) (1—x) cosh ops dx, 


where x = w/e and e = p’/m is the initial energy of relative motion of the electrons; the cross-section is 


averaged over values of the total spin of the electrons. The cross-section for energy loss by radiation is 


E 


t | doe =8.1ar! 
ro 


(B. K. Fedyushin, 1952). 


PROBLEM 3. Determine the energy of the radiation resulting from the emission by a nucleus of a 
non-relativistic electron in the s state. 


SOLUTION. The wave function of the emitted electron is an outgoing spherical s wave normalized 
to unit total flux: 


1 e ipr 


"SV bm) 7 


see QM, (33.14). As the wave function of the final state of the electron (after emission of the photon), 
we choose the plane wave 


wae 


The transition matrix element ts 
; * 
Pri = (pis)* = (| wp tpuy d x) 


= p’ | pci -rtipr 0X 
V (4arv) r 


U 


47 p 
vp -p 
TV 
vw’ 


the integral is calculated by means of (57.6a). The radiation energy is given by (45.8) on multiplying by 
d*p'|Qz) and integrating over the directions of p’ (which is equivalent to multiplying by 47). The 
spectral distribution of the emitted energy is then 

dE, = (2e’v"/37v) du. 


When w -> 0, the final velocity v’ of the electron tends to v, and the formula agrees, as it should, with the 
non-relativistic limit of the classical result; see Fields, §69, Problem. The total emitted energy is (in 


ordinary units) 
4 v\ 
a= 15a (=) = 


where & =2mv’ is the initial energy of the electron. 
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PROBLEM 4. Determine the energy of the radiation resulting from the reflection of a non-relativistic 
electron from an infinitely high potential barrier. 


SOLUTION. Let the electron be moving perpendicularly to the barrier. Although the photon may be 
emitted in any direction, in the non-relativistic case the photon momentum is small compared with the 
electron momentum, and we may therefore suppose that the reflected electron also is moving per- 
pendicularly to the plane of the barrier. Let the barrier be at x = 0, and the electron be moving on the 
side where x >0. The wave functions of the stationary states of one-dimensional motion, normalized by 
5(p/27r) (p = px), have the form of stationary waves (see QM, §21): 


Wi = 2 sin px, wy = 2 sin p’x. 


The matrix element of the operator p = px is 


oo 


Pfi = ~4i | sin p’x 4 sin px dx 
0 


integrals of this form are to be understood as the limit, as 6 > +0, of the values obtained by including a 
factor e ™ in the integrand. 

The energy radiated in a single reflection of the electron is found from (45.8) by multiplying by 
dp'= dw/v' and dividing by v/27 (the flux density of the wave approaching the barrier in the initial 
function i): 


_407e?, 2 2ardw 
dE. = ,7 [Pail vo! 


_ 8 2! 
=3 eww dw. (1) 


At low frequencies (w <e = 3mv’) we have v’ ~ v, and formula (1) becomes the classical formula (Fields 
(69.5)), which has to be integrated over angles, using the fact that v =3Av, where Av is the change in 
velocity of the electron on reflection; this is as it should be, since the condition for the collision time to 
be small (Fields, (69.1)) is always satisfied in reflection from a barrier. The quantum formula (1), 
however, also gives the total emitted energy (in ordinary units): 


_ (dE. _ 16 v? 
E-(@ dw = 9 ae =. 
0 


PROBLEM 5. Determine the bremsstrahlung energy in the scattering of a slow electron by an atom. 


SOLUTION. With the condition pa <1 (where a denotes the atomic dimensions), the scattering by 
the atom is isotropic and does not depend on the electron energy; see QM, §132. The wave functions of 
the initial and final states of the electron may be written 


ui =e "+ fer, uy =e? T+ fe”, 
where f is the constant real scattering amplitude. These expressions pertain to the asymptotic range of 


distances r>a, which in the present case is the important range: r~1/p >a. The matrix element 
calculated from these functions Is 


pri = (27flw)(v—v’'); 


the integrals are calculated as in Problem 3. Substituting this expression in (92.12), we obtain the 
cross-section for radiation with scattering of the electron in the direction p’, in ordinary units, 


_2ap" nrg do 
se ae v) doe w ) (1) 


where doe =f’ doy is the differential cross-section for elastic scattering. When hw < p’/2m, we can take 
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p ~ p’, and this formula then becomes, as it should, the non-relativistic expression for the emission of soft 
photons; see §98.t 
Integrating (1) over the directions of p’, we obtain 


a 2ap' 2 12 dw 
dow “BF ep + v")oel — (2) 


where oa = 47f’ is the total elastic scattering cross-section. Lastly, multiplying by fw and integrating with 
respect to w from 0 to p*/2m = e, we obtain the “effective retardation” 


Krad = i hw dow = & acae(vic)’. (3) 


$93. Electron-—nucleus bremsstrahlung. The relativistic case 


Let us consider the electron—nucleus bremsstrahlung for the case of relativistic 
electron velocities.t We shall assume that the condition for the Born approximation 
to be valid is satisfied for both the initial (v) and the final (v’) velocity of the 
electron: Ze*/hv <1, Ze’/hv'<1. The charge on the nucleus must be such that 
Za <1. 

As in §92, we shall neglect the recoil of the nucleus, so that the latter acts only 
as the source of an external field; the justifiability of this treatment is discussed in 
§97. 

According to (91.4), the cross-section for bremsstrahlung is given in terms of 
the amplitude by 


ae I 4 |2 w|p'| ! 
do = On) |Mj:| pl do, do' dw. (93.1) 


In the first non-vanishing approximation, the matrix element M,; corresponds to 
two diagrams: 


fin pin (93.2) 
p’ p p’ p 


The free end q corresponds to the external field, so that q=p’—p+k is the 
4-vector of momentum transfer to the nucleus. Since the recoil is neglected, the 
time component q° = 0. 

According to the diagrams (93.2), 


= + + 
My =— PAM@)VAmyeta'(y" a y+ yy y")u. (93.3) 


+ The fact that “factorization” of the cross-section (the separation of the factor oer) occurs here for 
any w is to some extent accidental, arising because the scattering amplitude is independent of the energy. 

+t The majority of the results given below were first derived by H. A. Bethe and W. Heitler (1934) 
and independently by F. Sauter (1934). 
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The intermediate 4-momenta are f = p —k, f'= p'+k. We shall use the notation 
f?-m’?=—-2kp=-2ko, f®—m’?=2kp'=2k'o. (93.4) 
AS is the scalar potential of the external field; for a purely Coulomb field, 
Af\(q) = 41Ze/q’. (93.5) 
Substitution in (93.1) gives for the cross-section 


Z’e° lp'|o 


(ee 
4m” |plq’ 


eXe,(i'Q"u)(UQ’u’) do, do’ da, (93.6) 


where 


Qt= » Vf +m 0 oyf +m p 


YT JeK’ Fane °°? 

Q’ = y°Q’*y° 
sf END, p 2 op WT 
2aKk' 2@K } 


Disregarding polarization effects, we average the cross-section over the directions 
of spin of the initial electron and sum over the polarizations of the final electron 
and photon. This is equivalent to the substitution 


e*e,(ii'Q"u)(4Q’u’) > —3 tr Q, (yp + m)Q*“(yp'+m). 


The trace is calculated by means of the standard formulae (§22). The calculations 
are somewhat simplified by using the equation 


y°(yp)y° = yp, 


where p = (e, —p) if p = (e, p). Moreover, the number of terms to be calculated can 
be reduced by using the symmetry with respect to the change p <p’, k>-k, 
q—-—q; this simply interchanges cyclically the factors in the product of matrices, 
leaving the trace unaltered. 

The result is the following expression for the differential cross-section for 
bremsstrahlung in which a photon of a given frequency is emitted in a given 
direction and the secondary electron travels in a given direction: fT 


+ Here and in the rest of §93, p, p’ and q denote the magnitudes of the three-dimensional vectors: 
p= IPI; p'= IP’, q = {qi. 
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do = 22% pally (de 4 do’ x 
4m? pq’ w 


2 2 2 
q 2 je 2 2 a) _ =) 
x {Fe +2e"—g) +4 (~ =) -4(= + 


2 2 ' 
+209 (L-4)-2% (<++)} (93.7) 


m K K m K K 


where k =e-—n-p, k’=e'—n-p’, n=k/w, q=p'+k—p. 
By means of simple transformations, this expression can be put in a form 
somewhat more convenient for analysis: 


do = Za deo p'm" sin 6 dé sin 0’ d@' dd Xx 

27 w pq 

« {a — (4e?— q’) sin’ 0’ +e “(4e?— 4?) sin? 6+ 

2w’* Drrianay ! 2 2 
Vee sin’ 6+ p” sin’ 9’) —=PP (26? +2e"—q*)sin 6 sin 0' cos de, 

(93.8) 
where 
K=e-—-pcos 8, Kk'=e'—p'cos @’, 


gq’ = p’+p”+w’—2pw cos 6 +2p’w cos 6'— 2pp'(cos 6 cos 6’+ sin 6 sin 6’ cos @); 


6, 0’ are the angles between k and p, p’ respectively; ¢@ is the angle between the 
plane of k and p and that of k and p’. 

The integration of (93.8) over the directions of the photon and the secondary 
electron is fairly lengthy. It leads to the following formula for the spectral 
distribution of the radiation: t 


dw p' (4 p’+p ll’ 
do, = Zar? 2 fF a96' PEP + mf =a lags spi) t 
Pe Se pla pp D° lg pp’ 


See’ ee'+p’ ce't+p” 
+L| + a(e7e? + 24 mes) +—— (22 5P te tp" |} 
3pp’ ip PP opp" p> p" 


where 


+ 
L=lo gS tpp PP 
pp’—m 
e+p fajenes 2 
E—p’ E'—p 


| = log 


+ The integration over the directions of the secondary electron only can also be completed in an 
analytical form; see R. L. Gluckstern and M. H. Hull, Jr., Physical Review 90, 1030, 1953. 

Reference may also be made to the review paper by H. W. Koch and J. W. Motz, Reviews of Modern 
Physics 31, 920, 1959, in which the bremsstrahlung formulae are represented graphically. 
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The permissible values of the frequency in these formulae are limited only by the 
condition imposed on the final velocity of the electron (Ze?/v' <1): the electron 
must not lose almost all its energy. As w ~0, the emission cross-section diverges as 
dw/qw; this illustrates a general rule which will be discussed in §98. 

In the non-relativistic limit (p <m), the photon momentum is small compared 
with the electron momentum, since 


<p. 


Hence q’~(p’—p)’. Putting in (93.8) ¢ =e'=m and neglecting p, p’ and w in 
comparison with m, we find 


fir aD 
FP ee Mm sin 6 d@ sin 6’ do’ dé X 
7 @ pq 


X (p’ sin’ 6 + p” sin’ 6’— 2pp’ sin 6 sin 6’ cos ¢), 
or 


Z’a’ p' > do, do' dw 
_ F inx ea aca 
da ay D (n X q) q' a 


(93.10) 


in accordance with the Born-approximation formula derived in §92, Problem 1. 
Correspondingly, the spectral distribution of the radiation is given by the formula 
(92.16) already derived.* 

In the ultra-relativistic case, when both the initial and the final energies of the 
electron are large (ce, c«’>m), the angular distribution of photons and secondary 
electrons is very unusual. For small angles 0, 6’, the quantities x, x’ which appear 
in the denominators of formula (93.8) are 


2 2 
«~te(™ +0"), c!~4e'(Ta+ 0”), (93.11) 


and become very small in the range 0 =< m/e. In this range the magnitude of the 
vector q is also small (q ~ m). Thus, in the ultra-relativistic case, the photon and 
the secondary electron move forwards in a narrow cone with aperture angle ~ m/e. 

A quantitative formula for the angular distribution in the ultra-relativistic case is 
easily obtained from (93.8) by substituting for «x, x’ from (93.11), replacing p, p’ in 
all other places by «, ©’, and neglecting q* in comparison with e7. With the 
convenient notation 


5 = ¢6/m, 5’ = ¢'6'/m, (93.12) 


+ The derivation of this formula by taking the limit in (93.9) is somewhat laborious, however, 
because of the cancellation of various terms. 
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we can put (93.8) in the form 


da ae 2 yp? = eat do 
T eq’ 
xf Me - 6" @ 57+ 5” 
(1+ 8°) G+e 2ece’(1+ 80 + 8%) | 


_(€,£)\_ 68’ coso 
Gar ea Ee le (93.13) 


Putting q’ = (n x qg)’+(n- gq) (n=k/w), we can easily find that for small angles 


1+ 86° ey 


2 
q = 2 2 ! 
—>5 = (6° + 6" —268' cos d) +m (> 7 75! 


(93.14) 


When 6 ~ 6’~ 1, the second term in (93.14) is small compared with the first. The 
two terms become comparable at even smaller angles, where 6 ~ m/e. Although q 
here becomes particularly small (q ~ m’/e <m), the integrated contribution from 
this region to the cross-section is still small compared with that from the whole 
region 5 <1 (the ratio of the contributions is easily seen to be m/e”). But q can 
also reach values q ~ m’/e when 6 ~ 8’ ~ 1 if 


[5-8 |<mle, bX mle. (93.15) 


The contribution from this region is of the same order as the whole integral 
cross-section, or may even be the principal term in it (see below). 

The integration of (93.13) with respect to @ and 56’ gives the angular distribution 
of photons (with given frequency), regardless of the direction of the secondary 
electron:t 


pz de & 6 dé 


2 
dao = 8Z*a ets (+385 * 
e& ¢’ 45? 2ee’ lle 1687 
{fst lng Af a ae t 18) 


Integrating with respect to 5, we find the spectral distribution of the radiation in 
the ultra-relativistic case: 


yi aed 2 -5)( dee’ 1), 
do, = 4Z*ar 2 = (545-5) (log == all (93.17) 


this formula can also, of course, be obtained directly from (93.9). 
The presence of the logarithm of a large quantity (the ratio ce'/mw ~ «'/m> 1 


+ The integration over ¢ from 0 to 277 is taken first. That over 5’ is conveniently replaced by integration 
over the difference |A| = |8’ — 5|, dividing the range into parts, from 0 to some Ao and from Ao to », where Ao 
satisfies the inequalities m/e <Ao<1. In each region, appropriate approximations are possible in the 
integrand. 
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even if w ~ e) should be noted. If this quantity is so large that its logarithm is also 
large, the logarithmic terms become the principal ones in these formulae. The 
logarithm arises from integration in the range (93.15).+ Thus, in the logarithmic 
approximation, i.e. when the terms not containing a large logarithm are neglected, 
the secondary electron moves at an angle ~(m/e) to the direction of incidence. 

Finally, we shall give the limiting formula for the region near the hard end of 
the spectrum, when the ultra-relativistic electron radiates almost all its energy: 
w =~e>e’. From (93.9) we easily find 


dw («e” e'+p' m’e' e'+p' é' 
972 2 AW JE eTP Mme, ,€ Trp €& 
do, = 2Z*ar; : {Sr log e’—p’ 4p? log ae at (93.18) 


Formulae (93.17) and (93.18) together cover the whole range of values of w for an 
ultra-relativistic initial electron, and agree for w=e>e’'>m. If the secondary 
electron is non-relativistic (p’<m), then 


do, =2Z’a 


r Vi2m(s ~ 0) do (93.19) 


E 


POLARIZATION EFFECTS 


Polarization effects in bremsstrahlung can be studied by the general method 
described in §65. The choice of the 4-vectors e, e® is here particularly simple. 
Since there is only one frame of reference (the rest frame of the nucleus) which is 
of practical importance, it is sufficient to put e“ = (0, e), e® = (0, e), where e"”, 
e” are unit vectors perpendicular to k, one lying in the plane of k and p and the 
other perpendicular to that plane. 

We shall not give here either the fairly lengthy calculations or their quantitative 
results, but merely note some qualitative properties of the polarization effects.t+ 
These properties can be derived by means of various symmetry relations, as was 
done for the Compton effect in §87. 

The theory under consideration corresponds to the first non-vanishing ap- 
proximation of perturbation theory. In this approximation the cross-section cannot 
contain a term proportional only to the polarization vector ¢ of the initial electron 
or ¢’ of the final electron. The absence of a term <C means that the total emission 
cross-section (summed over the polarizations of the photon and the secondary 
electron) is independent of the polarization of the incident electron. 

Of the terms proportional to only the photon polarization parameters &}, &), &3, 


+ This is easily seen by considering the range of integration in which @ and A= 6'—6 satisfy the 
conditions m/e <A, @ <1. In this range, q’?/m? ~ A’ + $’8’, and the terms in the braces in (93.13) are 
proportional to 6’ or A’ (becoming zero when ¢ = 0 and A = 0). Integrals of the form 


db’ dpdA A’ dd dA 
W+eoy | (+8767 


diverge logarithmically; they are “cut off” at the limits of the above-mentioned range of the variables. 

+ For a fuller discussion of these effects, see W. H. McMaster, Reviews of Modern Physics 33, 8, 
1961, and V. N. Baier, V. M. Katkov, and V. S. Fadin, Radiation from Relativistic Electrons (Izluchenie 
relyativistskikh élektronov), Atomizdat, Moscow, 1973. 
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the term «&}j is absent. Thus a photon radiated by an unpolarized electron is not 
circularly polarized. Here, however, there is a difference from the corresponding 
result for the Compton effect: in the latter case such terms were forbidden by 
spatial parity because of the impossibility of constructing a pseudoscalar from the 
only two available independent vectors, k and k’. For bremsstrahlung, there are 
three independent momenta p, p’ and k, and these suffice to construct the pseudo- 
scalar k- p Xp’. A term of the form &k-p Xp’ does not violate spatial parity, and 
therefore, strictly speaking, need not be zero; but it is not invariant under a change 
of sign of all the momenta (cf. (87.26)), and is consequently absent in the first Born 
approximation. 

The existence of the pseudoscalar k- px p’ also has the result that, as well as 
the term proportional to &3, a term proportional to &; is also allowed in the 
cross-section, unlike the case of the Compton effect. This term arises as a product 
of the form 


Saplalk X Vv) gk ° p X p’ 


(where vy=kXp), which is invariant both under spatial inversion and under a 
change of sign of all the momenta. Thus the emitted photon has linear polariza- 
tion of both kinds (both along the axes e“” and e”, and in the “‘diagonal” directions 
at 45° to these axes). This refers, however, only to conditions where the direction 
of motion of the secondary electron is also recorded. On integration over all 
directions of p’, the term ~«é&; in the cross-section vanishes. This is evident from 
symmetry, since after the integration the two non-coincident “‘diagonal’”’ directions 
become equivalent, and there can therefore be no preferential polarization along 
one of them, such as occurs when €; 4 0. 

The degree of linear polarization is independent of the state of polarization of 
the incident electron: the correlation terms of the form €;¢ and &3¢ in the 
cross-section are forbidden in the first Born approximation. The term &¢, however, 
is allowed, so that the photon radiated by a polarized electron is circularly 
polarized (Ya. B. Zel’dovich, 1952). 


SCREENING 


The formulae derived above are for a purely Coulomb field. If radiation in a 
collision not with a “‘bare”’ nucleus but with an atom is considered, allowance must 
be made for the screening of the nuclear field by the electrons, which reduces the 
cross-section. For this purpose we must include the atomic form factor F(q) in the 
potential A®(q) of the external field; see QM, §139. According to QM (139.2), this 
is done by writing Z—F(q) instead of Z. We shall show under what conditions 
screening is important. 

A given value of q in the form factor corresponds to distances r~ 1/q in the 
Spatial distribution of the electron charges in the atom. The form factor becomes 
almost equal to Z (total screening) when q < 1/a, where a is the dimension of the 
atom. 

In the ultra-relativistic case, as we have seen, an important contribution to the 
emission cross-section comes from the range of values of q near its minimum 
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possible value for given initial and final energies of the electron. In the ultra- 
relativistic case, 


Amin = P — P'— ® 
= V(e?—m’)— V(e"— m’) —(e - €’) 
= m’wl2ee’. (93.20) 


Screening is important if GminS 1/a, or 
ce'/mw = am. (93.21) 


This condition is always satisfied for sufficiently large energies of the incident 
electron. 

If qmin<1/a (“total screening’) we can immediately write down, with logarith- 
mic accuracy, the spectral distribution of the radiation. The argument of the 
logarithm in (93.17) is just the left-hand side of the inequality cc'/mw > am. When 
the inequality is satisfied, the integral over q which leads to this logarithm is cut off 
at a quantity of the order of the right-hand side of the inequality. According to the 
Thomas-—Fermi model a ~ aoZ '", where ay ~ 1/me’ is the Bohr radius (see QM, 
§70); then am ~ 1/aZ"?. Thus, when there is total screening, the logarithm in (93.17) 
should be replaced by log(1/aZ""). 


ENERGY LOSS 


The energy lost as radiation by the electron is expressed by the “effective 
retardation” 


e—-m 


Pe w doy. (93.22) 


0 


The calculation of the integral, with do,, from (93.17), givest 


2 2 
Kua Zar’ {jae log etp 
Pp m 
_(Be+6p)m’?, ,e+p 4, 2m’ (ree +P) 
Zep’ log m 2" Ep e m? : (93.23) 


where the function F(€) is Spence’s function (131.19). 
In the non-relativistic case, (93.23) becomes 


Krad = 16Z’arzm/3, (93.24) 


+ Although formula (93.17) is inapplicable near the upper limit, this fact is unimportant, since the 
integral converges. 
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since F(é) = € when & <1; see (131.23). This formula can, of course, be obtained 
by direct integration of the non-relativistic Born formula (92.16). 
In the ultra-relativistic case, 
eT ree (log 2 5)5 (93.25) 
when > 1, F(é) ~3 log’ €; see (131.20). The two log” terms in (93.23) can then be 
omitted. 

The ratio k,ag/é is also called the cross-section for energy loss by radiation. It 
increases logarithmically when « is large. This increase no longer occurs, however, 
when Screening is taken into account. For total screening, k,aq/e tends to a constant 
limit ~ 4Z7ar? log(1/aZ'"). 

For a collision with an atom, it must also be remembered that some radiation 
originates from the electrons, as well as that from the nucleus. We shall see later 
($97) that, in the ultra-relativistic case, the electron—electron emission cross-section 
differs from the electron—nucleus cross-section only in that the factor Z’ is absent. 
Hence the presence of Z atomic electrons can be approximately allowed for by 
replacing Z* by Z(Z + 1). 

On passing through a medium containing N atoms per unit volume, a fast 
electron loses its energy, on average, over a distance 


laa ~ €/Nkraa ~ [Z’a Nr? log(t1/aZ"™)} "1, (93.26) 


called the radiation length. 


COHERENCE LENGTH 


Formula (93.20) may be given a different and more general interpretation. For 
the expressions derived above to be valid, it is necessary that the external field in 
which the electron moves should vary only slightly (in the direction of motion) over 
distances 


leon ~ L/dmin ~~ €€'/m’*o (= €&'/c?m’w); (93.27) 


this is called the coherence length.| The value (93.27) obtained in the Born 
approximation is actually quite generally valid for ultra-relativistic particles: it is 
easily derived in the opposite case of quasi-classical motion also, since from 
(90.22)= we see at once that the important times for radiation at small angles to the 
direction of motion are 


7 ~ e'lew(1—v)~ &€'e/m’o, 
corresponding to a section of the trajectory with length cr ~ Icon. 


+ The discussion here is due to M. L. Ter-Mikaélyan (1953). 
+t The derivation of (90.22) was based only on the smallness of the curvature of the trajectory, and in 
that sense does not depend on the fact that a magnetic field was specifically under consideration in §90. 
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For a given frequency w, the coherence length increases with the electron 
energy. However, the formulae obtained for bremsstrahlung at an isolated atom 
can be valid also for passage through a medium only if there is no secondary 
photon emission or electron scattering over a distance equal to the coherence 
length. The condition for no secondary photon emission is Ion <lrag, but the 
condition for no electron scattering is violated much sooner: there is repeated 
scattering of the electron by the atomic nuclei in the medium over a distance ~I,a¢. 

To arrive at a quantitative condition, we return to formula (90.22) before the 
integration with respect to time in the exponent and write this as 


ttt ty+r 


we {(1—)r+4 | 0° at}, (93.28) 
ty 


W 


E 
a | (1-n-v)dt~- 
ty 


=H 


where @ is the small angle between v and n which results from scattering by nuclei. 
For Coulomb scattering, @ changes by small jumps, and its time variation is 
therefore a slow “diffusion in angle’”’. The mean square deflection of the electron 
over t — t; is, in order of magnitude, 


e~ (t — ty)/Icou, 
where Icou is the mean free path for Coulomb collisions, given by 
Lloout a (NZ7e*/e) 1o2(\ max! X min); 


where Xmax and Xmin are the maximum and minimum angles of scattering in one 
collision for which the process may still be regarded as Rutherford scattering (cf. 
PK, $41). The value of ymin is determined by the atomic dimensions a, over which 
the field of the nucleus is screened: ymin ~ 1/pa. Large scattering angles are limited 
(for an ultra-relativistic electron) by distances of the order of the nuclear radius R: 
Ymax ~ 1/pR. If we put R ~ 1.5x 10°°Z'? cm ~ r.Z'", we find 


oar e* oe (93.29) 
cou " NZ’e* log(1/aZ") mm? " 


The second term in (93.28), which covers a time 7 ~ I,,.4, is now estimated as 
077 i (wele’) leon! lout cn leon! Qlrad. 
For the bremsstrahlung formulae to be valid that were derived without allowance 


for multiple scattering, this term must be much less than unity. Hence we find the 
condition 


loon < ahaa, (93.30) 
which is stronger than I|.on <laq CL. D. Landau and I. Ya. Pomeranchuk, 1953). 


+ The mean free path is determined by the transport cross-section o; = | (1—cos x) da(y). For the 


scattering of ultra-relativistic electrons by a Coulomb centre, the cross-section da(y) is given by (80.10). 
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§94. Pair production by a photon in the field of a nucleus 


The formation of an electron—positron pair in a collision between a photon and 
a nucleus (Z+y—>Z+e +e") and electron—nucleus bremsstrahlung (Z + e > Z+ 
e +) are two cross-channels of the same reaction. Rules have already been 
formulated in §91 for the transformation of formulae from the latter case to the 
former. Applying these rules to (93.8), we find the following expression for the 
differential cross-section for pair production by an unpolarized photon, averaged 
over the polarizations of the components of the pair: 


Lars ar. m’p.p- de, 


do = er sin 6, d6,- sin 6- d0_ dd Xx 


x {Pe (462 — q’) sin’ 6, ties ; (4e3 - q’) sin? 6_ — 


2 

e e 2 se e e 
(p2 sin? 6, + p2 sin? 6 )—“P#P= (262 + 262 — q’) sin 6, sin 6_ cos ot, 
KiK_ KiK_ 


(94.1) 
K+ = €+ — D+ COS 0., q’ =(p.+p_—k)’, E,+te_=W 
(H. A. Bethe and W. Heitler, 1934). 


A similar transformation derives from (93.9) the energy distribution of the 
components of the pair: 


48 
do..=Z ates de {4 2e,2-Ps P + 
P+p- 
+m (Lo a ee ee )+ 
: “pt “ psp- 
+1 $8842 (e,e2+pip-— m’e.e_)— 
3p.p- psp” % ‘ 
_ m’w (i E16-— Ps | 8?) 
2psp-\" ps ~ ps 
~ €4,€-+ ppt m? _ €4 + Ds 
7> ae. eee gr aes de = ee : 
L = log 5 or l. = log =. (94.2) 


Since the above formulae are based on the Born approximation, they are valid 
under the conditions Ze*/v. <1. The symmetry of (94.1) and (94.2) with respect to 
the electron and the positron is itself a consequence of the Born approximation, 
and would not occur in higher approximations. 

In the ultra-relativistic case (e. > m), the electron and the positron are emitted at 
angles 0.~ m/e. relative to the direction of the incident photon. The angular 


+ Polarization effects in pair production by a photon are discussed in the papers already quoted in 
§93 in connection with bremsstrahlung. 
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distribution is given by a formula similar to (93.13): 


_8, > M*e,€_ {- 84 _ 62 we 624+ 82 
Coe Oe ead Cel Gg Beye EE 82) Dele: (i+ 821+82)~ 
€+ , &-\ 646_cos@d 
+ (=+4+—=)_"“te =O 
(2 -) (ae Bosh co er ae e® (94.3) 
with 
2 2 2x2 
q_ a Se 2 9) 1 or O4 1 + =| 
mi 0+ + 5- + 26, 5- cos p +m ( Bee Oe) (94.4) 
The energy distribution in this case is 
9) ») de, 2646~ 1 Be a ie 

do = 4Z ares FH (eit et here )(log = 7) (ultra-relativistic case). 

(94.5) 


Integration of (94.5) over ex from m to @ gives the total cross-section for pair 
production by a photon having a given energy:T 


20 109 
oe $2ari(log— re rea) ssan (94.6) 


As with bremsstrahlung, the logarithmic term in the ultra-relativistic cross- 
section arises from the range of values q ~ m’/e. This now corresponds to angles 
for which 


\5,-8|<mle, |r-—$|<mle, 


instead of @<m/e as in (93.15). Thus, in the logarithmic approximation, the 
directions of the electron and the positron are at small angles to the direction of the 
photon and are almost coplanar with the direction of the photon but on opposite 
sides of it. 

Near the reaction threshold (w >2m), the Born approximation is invalid. The 
derivation of a quantitative formula in this case would require an exact calculation 
of the Coulomb interaction of the three charged particles (the nucleus and the pair) 
in the final state. The symmetry with respect to the electron (which is attracted to 
the nucleus) and the positron (which is repelled from the nucleus) is then, of 
course, lost. 

If 


Za < (o28< 1. (94.7) 


+ Since the integral converges at both limits, the inapplicability of formula (94.5) for small values of 
€+— m is not important. 
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the Born approximation is still valid. At non-relativistic energies of the pair, 
w ~2m > p., and therefore gq ~ w. In (94.1) we can everywhere put e2j=Kk.=™m, 
w = 2m, and this formula then reduces to 


_ Zar? psp- 


do = 6477 m 


(p*. sin’ 6, + p2 sin’ 6_) do, do_ de.. (94.8) 


After integration over angles, 
(pit p2 
ee fee pep peter des 


2Z’ ar? 


ae oa 2m)V[(e,—m)(e_— m)] de,. (94.9) 


Finally, integration over c, from m to w —m gives the total cross-section 
me) 3 
o =F Zari(C—) | (94.10) 


If the relative velocity (vo) of the components of the pair formed is small, their 
Coulomb interaction must be taken into account (A. D. Sakharov, 1948). This 
interaction becomes important when vo is of the order of (or less than) the 
velocities of the particles in the bound state of the electron and positron (positronium): 


Vo Sa. (94.11) 


Let us consider the process in the centre-of-mass system of the pair. Virtual 
momenta ~m are important in the diagrams which represent the process in this 
system; that is, distances ~1/m between the electron and the positron are im- 
portant. The wave function W\(r) of their relative motion changes appreciably only over 
distances r ~ 1/mvo ~ 1/ma, which are large compared with 1/m. The allowance for 
the interaction of the particles therefore amounts to the inclusion of a factor #*(0) 
in the transition matrix element. The differential cross-section is accordingly 
multiplied by |(0)|’, i.e. by 


2 
ea (94.12) 


see QM, (136.11). The relative velocity of the two particles is the velocity of one 
particle in the rest frame of the other. Comparing the values of the invariant p,,p* 
here and in the laboratory system (the rest frame of the nucleus), we find 


2 


m 
Van. oo Pe 


whence vo may be found. If p, and p_ are similar in magnitude and direction, vp is 
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given by the approximate formula 
vp = po’ + (ps — p-\e? (94.13) 


valid for vp <1; here p = (p.+p-_), € =xe€,+&-_), and 9 is the angle between p. 
and p-. 

The correction to the cross-section according to (94.12) and (94.13) causes an 
anomaly in the correlation between the momenta of the electron and positron 
formed: it has a narrow maximum at p; ~ p-. 


§95. Exact theory of pair production in the ultra-relativistic 
case 


In §§93 and 94 we have discussed bremsstrahlung and pair production by a 
photon in the relativistic case, using the Born approximation, for which the 
condition Za <1 must always be satisfied. In §$95 and 96 we shall describe a theory 
of these processes which is not subject to the limitation just mentioned, i.e. is valid 
even if Za ~ 1 (H. A. Bethe and L. C. Maximon, 1954). We shall assume that both 
the particles (the initial and final electrons, or the constituents of the pair) are 
ultra-relativistic, with energy « >m. 

We have seen that in the ultra-relativistic case both particles move at small 
angles (6, 6’ or 6,,6_) to the direction of the photon: @=<m/e. This property is 
preserved in the exact (with respect to Za) theory, and we shall therefore consider 
just this range of angles. 

The momentum transfer to the nucleus in this range 1s q ~ m. This means that in 
the wave functions the important values of the impact parameter are p ~ 1/q ~ 1/m, 
i.e. “large’’ distances. At such distances the wave function derived in §39 can be used. 
The calculations for pair production are as follows. 

The pair production cross-section is similar in form to the photoelectric effect 
cross-section (cf. (56.1), (56.2)): 


2. ] : 7 d’p, d’p- 
ee eV (4) 5m Mi Bor — 6-2.) BSP (95.1) 
where 
Ma = | wo* (a eve Wp, d (95.2) 


Here w°’,. is the wave function of the electron, and p“?, -,, the wave function 
with negative energy —e, and momentum —p,. 

The function wS”,., which pertains to a particle in the final state, must have an 
asymptotic form which includes (besides the plane wave) an ingoing spherical 
wave; this is indicated by the superscript (—). According to (39.10), this wave 


QE4 - BB 
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function ist 


po <= om (1 - 3 VF (-ip 1, -i(p_r+p_-r))u(p-_) 
—» p- V(2e-_) Qe_ 9 *9 (95.3) 
C=e"™'T1 +i), vy = Za. 
The function wy, _», must have an asymptotic form which includes an outgoing 


spherical wave (indicated by the superscript (+)), since it denotes the wave 
function of an “initial state with negative energy’. The asymptotic form of the 
wave function of the positron, obtained from w°* _p,, then has an ingoing wave, as 
is correct for a final particle. According to (39.11), this function is 


(+) Co 1a : V 


= —ipy cr —] 1 ° = 
Wun Tae (1+ — )F iv, 1, (part ps nul | = 
CP =e "T+ iv). 


The terms ~1/e in (95.3) and (95.4) have to be included because of the matrix 
structure of M;; (95.2). The matrix element (a),; is a vector whose direction is close 
to that of k. The leading term in (a- e),; is therefore small, and the correction terms 
are of the same order of magnitude as that term. 

Substituting (95.3) and (95.4) in (95.2) and neglecting terms ~1/e.e_, we find 


Ma = a7 pre WP Mle a) + (e- aN(a-L.)+(a-Lye-au(-p.), 5.5) 


where 


N =C%C® = wp/sinh mv, (95.6) 


I = |e“ "FEF, d°x, 


I, = | e-eFAVR, d°x, 


l 
Be (95.7) 


_ to —iqer * 3 
ie se (VF_)*F, d°x, 
q = p.+p-—k; 


F_ and F, are used for brevity to denote the hypergeometric functions which 
appear in (95.3) and (95.4). The integrals I, L., I satisfy one identical relation: from 


| Ve PERF, d°x =0, 


+ In this section, ps = |p-|, q = |q|. 
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we have 

ql + 2€.1,+ 2e_L = 0. (95.8) 
We average |M,;|* over polarizations of the incident photon, and sum over direc- 
tions of the electron and positron spins.t This is done by the tensor substitution 


eet > (Six —nh), n= k/w, 


and changing the bispinor products according to 


UsUs > 2ps = (€sY 
Putting also a = y’y, we find 


|My; |? ae: (N?/2e.e_){tr p-Q: p+Q —tr p-(n-Q)p:(n- Q)}, 
Q=y!1- yyy - 1) - yy - Ly, 
Q=yl*- yy -Dy- yyy: I*). 


The final result, obtained after making the appropriate approximations, for the 
ultra-relativistic case at small angles 


6.~mfi/e <1, (95.9) 
will be given here. We define the auxiliary vectors 


6, (95.10) 


Where the suffix | denotes the component perpendicular to the direction of k. Then 


moO4 


MiP >IN~Z es rnp +2|r 2 +1, 


+ 2[r Bea , (95.11) 


where we have used the fact that I ~ eI./q ~ cIl./m (as is seen from (95.8)), and 
terms of higher order in m/e are omitted. 
The integrals I. may be expressed as 


_. ps 95 
Dee dps’ 


—iq-r 
J -[£ — F(-iv, 1, i(par + pe) F (iy, 1 i(p-rtp-+n))d’x. (95.12) 


+ Calculations with allowance for the polarizations of all the particles are given by H. Olsen and L. 
C. Maximon, Physical Review 114, 887, 1959, and in the book by Baier et al. cited in §93. 
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The integral J can be written in terms of the complete hypergeometric function:f 


_4n d,_2Ba)" tie 
r= Gare F(-iv, iv, 1, 2), 
(95.13) 
y=? q'(p+p-— p+: p-) + 2(ps - q)(p-° q) 
(q’ — 2p. q)(q°—2p-- q) 


The differentiation with respect to p. must be carried out with q fixed, only 
thereafter putting q=p:+p_—k. The result, after making the approximations 
corresponding to the ultra-relativistic case and the conditions (95.9), is 


L= 


iv Z 
4 (ES) [a rgeFoti Groat-—msy}, 5.14 


with, for brevity, the notation 


2 
é=r, 802 = 1-44 BE, 
1+ 6: m* (95.15) 


F(z) = F(-i, iv, 1, z), 


F(z) being a real function. The integral I is then found immediately from (95.8). 
Substituting the values of the integrals in (95.11) and thence in (95.1), we find 
the required cross-section: 


do = = 


T 


TV 
sinh av 


2 m4 
) Zari Ms 5, dB, - 8 d8_- dé de. x 


x {F%(z)[-2e+6-(628% + 62 €&) + w°(84 + 82)E,E + 
+ 2(e% + €7)5,5_E,€ cos d] + 


q* EE" 
+ ad P'(z)[-2e +e (Si Es + 82") + w'(1 + 55 8°)EE — 


—2(e4 + €7)5.8_&,€ cos 61}. (95.16) 


When 1» > 0, 


TV 


sete as = ’ — 2 
sinh aaa F(z)> 1, F'(z)=v’°> 0. 


The expression (95.16) then reduces, as it should, to Bethe and Heitler’s formula 
(94.3), which corresponds to the Born approximation. It also reduces to this 
formula for any v if the angles of emission of the pair satisfy the conditions 


\s,-5|<1, |ar—$|<1. 


t+ The calculations are given in Nordsieck’s paper quoted in §92. 
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For then q <m, so that the second term in the braces in (95.16) can be omitted 
because of the extra factor (q/m)* as:compared with the first term, and in the first 
term we have (since 1 — z ~ q’?/m’* K1)t 


F(z) > F(1) = F(-iv, iv, 1, 1) 


7 1 
~ Fd — iv) + iv) 


_ sinh ub (95.17) 
TV 
so that the similar factor in front of the braces is cancelled. 
Let us now consider the integration of the cross-section over the directions of 
emission of the pair. The integration over angles is divided into two regions I and 
II, in which we have respectively 


(d) 1-z>1-2, (I) 1-z<1-2Z, 


where z, is a certain value such that 1 > 1—z,>(m/e)*. Since in region II 1—z <1, 
q’<m_’, it follows from the above discussion that in this region do ~ doz = do,-o, 
where doz is the cross-section in the Born approximation. The integral over angles 
is therefore 


do,.={ do =| do+ | doy 
I it 


=i. ed | (ag Se 2): (95.18) 
{ 


where (do,+)z 1s the Born cross-section (94.5) integrated over angles. 
In region I we have 


q?|m? = 82. + 82. +28,8_ cos ¢. 


We shall change from the variables 5,, 6_, @ to &, &-, z. A direct calculation of the 
Jacobian for this transformation gives 


e,e- d&, dé_ddb 


54 dé, -6_ d5_: dd se 8m2 (€,é_) sin db’ 


where 
LZ (q’?/m*)é,é- 
=€,+é -2&é +2V(&E(1— &)0—- €)] cos 6. 


+ This value of the function can be obtained from QM, (e.7), which relates hypergeometric functions 
with arguments z and 1 — z. 
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Expressing sing and cos@ in terms of the other quantities by means of this 
equation and substituting in (95.16), we obtain after some simple algebra 


7 2dé, dé dz 
dg" Ade a= ned Dee 2e-e 


. lanep th (ei + €2)(1 — z) + 2ese_(&, — €)7]+ 
eae Fe) [(e3 + e2)z + 2e,e_(€, + & — yi}, 


A=( Tv ) Fare 


sinh wv) 270° 
Finally, we replace &, and &- in terms of new “spherical” variables y and w: 


&,+€ -1=Vzsiny cos yf, 
€&,-& = V(1-z) sin y sin ti, 
O0<y<in7, 0<yW<2z7, 


2 dé. dé. > V[z(1 — z)] sin x cos y dy dw. 


These ranges of variation of y and w correspond to the range 0 to 1 for & and é-, 
i.e. to the range 0 to © for 6, and 6_- (or, equivalently, 6, and 6_); the rapid 
convergence of the integral allows the range of variation of the angles to be 
extended in this way. After the transformation, the root in the denominator 
becomes V[z(1—z)]cos y; the integration over x and & is elementary, and the 
result is 


2 
do =2A -2ndz(ei+e2+ ge.0)(52 +5 F'2)| dex. 
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An extra factor 2 has been included because the integration over z is to be taken 
from 0 to z,, whereas, when the azimuth @ varies from 0 to 7 and from 7a to 27, 
each value of z occurs twice. 

The integration over z is effected by means of formula (92.14), which, for 


v'=—v (and F(z) accordingly real), becomes 
BP... Buse Tad 
fs ee = Fi yy CEP). 


The integral of this expression is z,;F(z,)F'(z,)/v?. The value of z,F(z;) ~ F(\) is 
taken from (95.17), and the limit of F’(z, > 1) is given byt 


gion 


~ (log(1 = z) + 2f(v)] B=, 


where 
f(v) = 7VC + iv) + VO — iv) -2V(1)] 
pie 1 
A n(n’?t+ vy’ 
W(z) =T'"(z)/T(z). 


(95.19) 


Substituting the above expressions in (95.18), we obtain as the final formula 


des 


do. =4Z’arr(e2 + © + he e- [oz ar - faz) (95.20) 
The total cross-section for pair production by a photon with energy w is 
og =3Z'a rillog 2-4 flaZ)|, (95.21) 


We see that the only change in these formulae is that a universal function f(aZ) of 
the atomic number is subtracted from the logarithm. Figure 18 shows a graph of 
this function. For v <1, f(v) ~ 1.2v”. 


$96. Exact theory of bremsstrahlung in the ultra-relativistic 
case 


The matrix element for the bremsstrahlung process is 


= | WO (a-e*) ey dx; (96.1) 


+ The derivation of this formula is in the Appendix to the paper by H. Davies, H. A. Bethe and L. C. 
Maximon, Physical Review 93, 788, 1954. 
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the wave functions of the initial electron (¢, p) and the final electron (e’, p’) include 
respectively outgoing and ingoing spherical waves in their asymptotic forms. The 
calculation of this integral is similar to that of the matrix element (95.2), but will not 
be given here. Instead, we shall describe another way of calculating the brems- 
strahlung cross-section, based on the fact that the process is quasi-classical, and 
not using the explicit form of the wave functions of the electron in the field of the 
nucleus (and in this sense independent of the precise form of the field potential)(V. 
N. Baier and V. M. Katkov, 1968). 

In the bremsstrahlung process, the nucleus transfers to the electron and the 
photon a momentum q=p’+k~—p. As in the pair production problem, we must 
distinguish two ranges of values of the transfer q, which is transverse relative to p: 


(I) m=q,>om’/e*, (1D q,~wm?/e*<m. (96.2) 


It is evident that in region I the emission cross-section is equal to the Born value: 
for these values of q,, the recoil momentum of the nucleus is unimportant, as will 
be shown in §98 (see the derivation of the condition (98.10)). In region I, the 
cross-section for the process is therefore the product of the exact cross-section for 
electron scattering in the field of the nucleus at rest and an emission probability 
which is independent of the form of the field. But since, according to (80.10), the 
exact cross-section for scattering at small angles in a Coulomb field is equal to the 
Born value, so is the cross-section for the whole process in region I. 

Thus only region II need be considered. Small momentum transfers correspond 
to the passage of the electron at large distances from the nucleus: p ~ 1/q, ~ e/m’. 
But, at these distances, the motion of the electron is certainly quasi-classical, as is 
easily seen by direct application of the usual quasi-classical condition, QM (46.7), to 
the ultra-relativistic equation (39.5). 

Since the motion is quasi-classical, we can use the method already applied in 
§90 for synchrotron radiation. The expression (90.7) is in this case the probability of 
emission when the electron passes the nucleus once. 

Formula (90.18) remains valid for the function L used in §90; the only 
difference is in the form of the quasi-classical electron path r= r(t), which is used 
to calculate the difference r,—r;. 

At large impact parameters, the field of the nucleus may be regarded as weak. 
In the zero-order approximation, the path is a straight line passing at a distance p 
from the centre. In the next approximation, we have as the equation of motion (cf. 
Mechanics, §20) 


where p is a vector lying in the xy-plane and perpendicular to the initial momentum 
of the electron, and r on the right-hand side is to be taken as the zero-order 
function: 


r=V(p'?4+ vt?) = V(p?+t’). 
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Hence 


t 
dU dt 
p(t)—pi=—p | ee (96.3) 
ty 


The velocity v(t) = p(t)/e, where the energy e« depends on the magnitude but not 
the direction of p, may be regarded as constant with sufficient accuracy. A further 
integration then gives 


Sr eee -+ [p(t’) — pi] dt’. (96.4) 


We shall take t; =—, so that the quantities p; = p(-~)=p and v=p/e are the 
initial momentum and velocity of the electron. 
We can put the probability (90.7) in the form 


dw = |a(p)|’ d’k/(27)’, (96.5) 
where 
21 . € 
a(p) = e/2% | R(t) expyi ~ (wt —k- r(t))| dt, 
Ch ee ee Jes (96.6) 
(2e’) (Qe) 
and «'=e-—a, p(t) = p(t)—k. The classical function p(t) is given by (96.3). If p 
denotes the initial momentum of the particle, we have for a Coulomb field (U = — /r, 
v = Za) 
nee t 
p(t) = p~ 28 | ay +1], 
and 


rab t-te (Veo + 1) +t), 


In terms of the change of momentum in classical scattering, 
A = p(~) — p(— ©) = — 2pu/p’, (96.7) 


we can rewrite these formulae as 


w= p+ial ere ps +! 


i (96.8) 
r(t) =(p +4a)* +5~V(t?+ 9%). 
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Now using formula (90.20) for R(t) and the expressions (96.8) for p(t) and r(t), 
we can calculate the integral with respect to time in (96.6). The integration is carried 
out by replacing the variable t by 


§=—5 (wt —k- r(t)) 


and using the formula 


—— 
Tere gy = 2ixKiWo., 


where K, is the Macdonald function. There is, however, no need to complete these 
calculations, since we want the expression a(p) only for small values of A (A <m), 
an independent parameter. Then we obtain 


a(p) = wfDw; - AyK,(y), (96.9) 


where 
WE 
x=p77(U-n-y), 


n = k/w, and D is some function of p, « and k (but not of p), whose precise form is 
unimportant.t Since, in the ultra-relativistic case, the photon is emitted at a small 
angle @ to the direction of the electron velocity, we have 


x~p—o(1—v +16") 


or 
= Raat (1 5’) 5 = 0e/ (96.10 
X=Pp =f a : = OE;/M. 96.10) 


It has already been mentioned that (96.5) is the probability of photon emission 
in a single passage of an electron past a nucleus at impact parameter p. The 
cross-section for the emission of a photon with given frequency and direction is 
obtained by multiplying by v~' dp, dp, ~ dp, dp, =d’p and integrating with respect 
to the impact parameter: 


3 
do = ss | |a(p)|’ d’p. (96.11) 


+ The spinors w; and wy may be taken as constant in the integration, i.e. the change in the electron’s 
polarization in its classical ultra-relativistic motion may be neglected. This can be seen from the 
equations derived in §41. 
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However, it should not be thought that this formula without the integration over 
d*p would also give the directional distribution of the final electrons. The deviation 
of the electron in motion in a classical path, which is uniquely determined by the 
external field, is certainly not the same as the indeterminate quantum-mechanical 
deviation (and the limit p’() of the classical function p(t) is therefore also 
different from the actual final momentum of the electron). Consequently, in order 
to obtain the angular distribution of the electrons, we must re-expand their wave 
function in plane waves. 

It is seen from (96.11) that a(p) is the amplitude of photon emission in a passage 
at impact parameter p. The expressions (96.5) and (96.6), however, define this 
amplitude only to within a phase factor, which is easily seen to be e ‘*'®: on 
account of the time-independent term r,(~) =p in r(t), this constant factor must 
be present in V;(t), and may be taken outside the time integral. Since it is not an 
operator, it is not affected by the process of commutation, and the amplitude for 
the emission process is thus 


e'* °a(p), (96.12) 

where a(p) is given by (96.9). 
Now let the electron be described, as z >— ©, by a plane wave with momentum 
p along the z-axis. This means that the wave function of the electron as z >—™, as 


a function of x and y, reduces to a constant, which can be taken as unity. Then the 
wave function of the electron which has passed through the field is, for z > ©,T 


ys(cc) = S(p) = exp{- i | U(x, y, Z) az}. (96. 13) 

According to the significance of the transition amplitude (96.12), the wave function 
of an electron which has passed through the field and emitted a photon is 

e'*'a(p)S(p). (96.14) 


The amplitude for emission in which the electron goes to a state with definite 
momentum p’ is given by the corresponding Fourier component of (96.14), 1.e. by 


a(qi) = | ee °a(p)S(p) ap 
= |e" Pa(p)S(p) do, (96.15) 


where q, is the transverse component of the momentum transfer to the nucleus; cf. 


+ See QM (131.4); we have in mind the analogy between equation (39.5) (in which we put p’ ~ e’) 
and the non-relativistic Schrédinger’s equation (39.5a). Bearing in mind the difference in the significance 
of the coefficients in these equations, it is easily seen that in our case the conditions QM (131.1) for the 
formula QM (131.4) to be valid are in fact satisfied. The fact that this formula is not valid for arbitrarily 
large z is unimportant, for the same reasons as in QM $131. 
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QM (131.7). The cross-section for scattering with a given transfer q, 1s 


3 2 
do = |a(q,)/ on oe (96.16) 


Let us now calculate S(p). In the case of a Coulomb field considered here, the 
integral in the exponent diverges, in accordance with the phase divergence in 
Coulomb scattering. The integral must therefore be first calculated between finite 
limits: 


R R 
= dz 
; U dz _ 2v | ver +7) 
= —2p[log(R + V(R’ + p’)) — log p] 
= —2y log 2R + 2v log p 


(R > p). The first term, which is a constant, is unimportant, and therefore 
Sip) =e er Say, (96.17) 


Substituting (96.9) and (96.17) in (96.15) and integrating over the directions of 
the vector p in the xy-plane, we find 


[e,2) 


aq » |p? KiGdI(aup)e dp. (96.18) 


0 


where J; is the Bessel function. The factors not involving v = Za have not been 
written here. 

We see that the dependence of the amplitude a(q_,) (and therefore of the 
cross-section (96.16)) on v is contained in a separate factor. On the other hand, 
when v > 0, the cross-section must tend to its value in the Born approximation. It is 
therefore immediately clear that the cross-section will differ from the Born value 
only by a factor which is independent of the electron polarization and hence does 
not influence the polarization effects. 

The integral (96.18) can be expressed in terms of the hypergeometric function 
by means of the formula 


r on! acl 2 ~145 2 
| Gon Oowe= Lea ee) (1+2,) F(R, 1-4, ee ) 


0 ae ave t+ b2 
0 


This gives 


a(q:) « v(1— iv) Gqy? T°) — iv) F (iv, 1 — iv, 2, 2), (96.19) 
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where 


z=1- _mo” 43 aye. 5 = c6/m; (96.20) 


m‘o 
4q ae T 
here we have used the fact that, in region II (see (96.2)), the component of the 
vector gq parallel to p is 


: wires 
qi=@?—-qi~qa-a(lt 6. (96.21) 


This is easily proved, since in that region the angles between the momenta p, p’ and 
k satisfy the conditions (93.15). 

The hypergeometric function in (96.19) can be reduced to the function F(z) in 
(95.15) by means of the formula 


F(a,b+1,¢+1,2)=—— F(a, b, c, 2) + FS Fa, DeGo7). 


The final result is then 


teas: Fo Por 2)" Bg |, (96.22) 


where dog is the Born cross-section (93.13) (H. A. Bethe and L. C. Maximon, 
1954). When q>m7’/e, we have z~1, and the whole coefficient of dag tends to 
unity; in this sense formula (96.22), which has been derived for region II, is 
automatically satisfied for all gm. When q=<m7’/e and the correction factor in 
(96.22) is different from unity, the vectors p, p’ and k are almost coplanar, and the 
quantities 6 and 6’ are almost equal; this has already been taken into account in 
(96.22). Thus q’ in the expression (96.20) for z can be written as 


@? te? 
+, = 6°+ §”—288' cos ¢ Tie ol (1+ 87)’, (96.23) 


= 


i.e. we can put 5 = 6’ in the second term in (93.14), but not in the first term, which 
does not contain a small coefficient (~m?/e”). 

To find the cross-section integrated over angles, there is no need to repeat the 
integration: we can proceed as follows (H. Olsen, 1955). Various directions of p’ 
(for a given energy «') correspond to degeneracy of the final state of the electron. It 
is evident that the result of summing over states which belong to one degenerate 
level is independent of how the complete set of these states is chosen. We can 
therefore use, in summing over directions of p’, the set of functions p{) instead of 
the wS?, which are needed in calculating the differential cross-section, i.e. we can 
define the bremsstrahlung matrix element as 


Mii = | Wea e* em TS dex. 
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This integral is easily seen to be the same as (M#?)* if the parameters of the wave 
functions in the latter are changed as follows: 


P+,P+5€:>—P,—P,—€; — P-, P-,E-> Ps P,£5; k>—k 


and the sign of the integration variables is reversed: r>—r. 

Hence it is clear that the bremsstrahlung cross-section integrated over angles 
can be obtained from the integral pair production cross-section (95.20), on multi- 
plication by 


w’ dw w’* da 


~_{_~ 


pi de, ede 


(cf. (91.6)) and replacement of ¢, by —«, e- by e’. Thus we have 


do = 4Z?ar? = (E +£-5)/los = — 5 - f(az)| (96.24) 
We see that the corrections to the Born formulae for the integral bremsstrahlung 
and pair production cross-sections are given by the same function f(aZ). 

Formula (96.24), which does not depend on any limitations on the value of Za, 
allows a passage to the classical limit (h > 0, Za >). In this limit, we must also put 
e ~ e’. Bearing in mind the asymptotic expression WV(z) ~ log z as |z| > and the 
value V(1) =— C (where C is Euler’s constant), we find the effective retardation 


2 2 
a 16Z*ree" ef 2€ 


ae Oe cnze. 2. c| dw. (96.25) 
This expression, which does not contain h, is the classical frequency distribution of 
the bremsstrahlung intensity. 


$97. Electron-electron bremsstrahlung in the 
ultra-relativistic case 


Electron—-electron bremsstrahlung is represented by eight Feynman diagrams: 
four diagrams 


pg ee p, . a ae 
| , (97.1a) 
| 
py. ee eS 0 Pe ee ae 
pi ——.—-_=— P p, saa eae 
| 
| | 
0; ; P, D’, = (97.1b) 
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and four “exchange” diagrams obtained from those shown by interchanging p; and 
p». Here we shall give the results of the calculations for the ultra-relativistic case 
(G. Altarelli and F. Buccella, 1964; V. N. Baier, V. S. Fadin and V. A. Khoze, 
1966).T 

In the laboratory system (the rest frame of one of the initial electrons, say the 
second), the emission cross-section integrated over the directions of the photon can 


be written as a sum do = do") + do™, where 
do” = dar? #2 £— 8 (_£_ 4 £— 9 _ 2) (Ig 280) _f), (97.2) 
do = jar? wee {(4 — = + m,) log = gta at = aad form 22m, (97.3) 
do = jar? te {8(1 — = + 2.) log m “2 
“RnB - 8g at] 
— 4 m9 2 48 sor wy <hm (97.4) 


(e being the initial energy of the first electron). 

These formulae are accurate as far as terms of relative order m/e. To this 
accuracy, it is found that the contributions to the cross-section from different 
diagrams do not interfere, and in this sense do? and do™ correspond to emission 
by each of the two electrons: the fast electron and the recoil electron respectively, 
diagrams (97.1a) and (97.1b). 

The “‘exchange”’ diagrams give the same contribution to the cross-section as do 
the “‘direct’’ diagrams. Since the electrons are identical, the total contribution from 
the direct and exchange diagrams has to be halved, and we may therefore consider 
only the contribution of the direct diagrams and ignore the identity of the particles. 
For electron—positron collisions the exchange diagrams are replaced by annihilation 
diagrams, but their relative contribution is of order m/e and therefore negligible. 
Hence the bremsstrahlung cross-sections are the same, to the accuracy indicated, 
in electron—electron and electron—positron collisions. 

For w >m, the ratio 


i.e. the emission from the recoil electron is small compared with that from the fast 
electron; when this ratio becomes of the order of m/e, formula (97.3) is of course 
no longer meaningful. When w<m, on the other hand, the two parts of the 


+ The calculations are given in the book by Bafer et al. cited in §93. 
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cross-section are almost comparable: 


2 
do? = Par? Go log cee 
63) M@ 
w <m. (97.5) 
216, .2dM,  & 
do = sare ~ log —, 


For formulae (97.2)-(97.5) to be valid, it is necessary that at least one of the 
electrons should remain ultra-relativistic after emission of radiation, i.e. the photon 
frequency must be sufficiently far from the hard boundary of the spectrum (the 
maximum frequency @max that can be emitted). The final energy of the electrons is 
least, and the photon energy greatest, when both electrons move, after emission, in 
the direction of the photon at equal speeds. The conservation laws then give 


SN = Oia LE 5 |p| = @max + 2|p’]. 


Hence, eliminating ¢’ and p’, we have 


(e lf See Wmax) = ({p| = Wmax)” = 4m’ 
and 


_ m(e—m) 
max ~ mte- Ip} (97.6) 


When ¢ >™, @max ~ €. Thus formulae (97.2)-(97.4) are valid if 
Wmax- @ ~E~W>M. (97.7) 


The cross-section (97.2) for emission by the fast electron is exactly equal to that 
for electron—nucleus bremsstrahlung when the nucleus has Z = 1 (formula (93.17)). 
This agreement is not fortuitous, and can be explained by considering the 
significance of recoil in the emission process. 

In deriving (93.17) we neglected the recoil of the fixed particle (the nucleus), 
replacing it by a constant external field. This was equivalent to neglecting the time 
component of the momentum transfer 4-vector q = p'’—p+k (the recoil energy). 
We shall show that, in the ultra-relativistic case, this treatment is permissible for 
electron—electron as well as electron—nucleus bremsstrahlung. 

We write 


—q’=—(e'+w—e)+(pjt@— py)’ +(pi- pi)’, (97.8) 
where the subscripts indicate the components of the vectors p’ and p (the initial and 


final electron momenta) parallel and perpendicular to the direction of the photon k. 
In the ultra-relativistic case the angles 0, 6’ between k and p, p’ respectively are 
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small: 6 <= m/e, 0’<m/e'. Hence 


|p. = |p|@ ~mM, 
2 2 2 


and similarly for p; and pj. 
Neglecting recoil, we have e’+ w —« =0; the term pj+ @ — pj ~ m’/e, and so 


—q’~(pi— pi) ~ m’, (97.10) 
The energy of (electron—electron) recoil is 
qo=e'+a-—e~q'/2m~m. (97.11) 


The change in p; due to the change in «’ is negligible. The change in q’ with 
allowance for recoil, which we donote by Aq’, is therefore given by the first two 
terms in (97.8). Using (97.9), we have 


2 12 Z 2 
dg?~(e'+o-e)(— Be +P 


E E E 


~m’*- mle. 


Comparison with (97.10) shows that Aq’*<|q’|, and the neglect of the recoil is 
therefore justified.t 

The fact that the fast particle emits into a narrow cone (with aperture angle 
~m/e) in the direction of its motion enables us to deduce the cross-section in the 
centre-of-mass system by a simple conversion of the cross-section (97.2) from the 
laboratory system.+ 

In the centre-of-mass system the two electrons emit in the same manner, each 
in the direction of its motion. (It may be noted that this gives an intuitive 
explanation of the absence of interference between the radiation from the two 
particles.) The energy E of the ultra-relativistic electron in this system is related to 
its energy « in the laboratory system by 2E” = me; the respective photon frequen- 
cies and w are related by w/e =Q/E. These equations are easily obtained by 
comparing the values of the invariants (p,p2) and (p,k) in the two systems. The 
cross-section for emission by each electron in the centre-of-mass system 1s there- 
fore 


do = do® 


= P| E E-Q 2)(1 se Sa 
eae m2, 2)" 


a (97.12) 


+ This conclusion is, of course, valid a fortiori in the case of electron—nucleus bremsstrahlung, for 
which the recoil energy qo ~ q’/2M ~ m’/M, where M is the mass of the nucleus. 

+ In general such a conversion is not possible, because the contribution to the spectrum in a given 
frequency range dw comes from photons emitted in quite different directions. 


QE4 - CC 
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For (97.12) to be valid it is also necessary that the photon frequency should not 
be close to the boundary of the spectrum. For an ultra-relativistic particle, the 
above-mentioned transformation gives immediately, when Wmax ~ €, 


Omax ~ OmaxE/e ~ E. (97.13) 


Thus, in the centre-of-mass system, the electrons can emit only half of their total 
energy 2E. A direct calculation of Qmax is easily performed by noting that, after the 
emission of such a photon, the electrons will move (in that system) at equal speeds 
in the direction opposite to that of the photon. We have 


26 = 2E +O as. 2|p'| = OQmax; 
whence 
Omax = p’/E = E-m’/E, (97.14) 


and in the ultra-relativistic case again (97.13). Thus formula (97.12) is applicable 
under the condition 


OLR = OS ESO. (97.15) 


We shall now give some formulae for emission in the centre-of-mass system in 
the opposite limiting case, near the boundary of the spectrum, whent 


Omax — << m. (97.16) 


Since in this case the recoil is very important, the results differ from those for 
scattering by a fixed centre and are also different for electron—electron and 
electron-positron scattering (V. N. Baier, V. S. Fadin and V. A. Khoze, 1967). 

In electron—electron scattering, besides the squares of the diagrams (97.1), there 
is also a contribution to the emission cross-section near the boundary of the 
spectrum from products (interference terms) of the direct and exchange diagrams, 
in which a given initial particle emits, for example, the product of the second 
diagram (97.1a) and the diagram 


This is because, near the boundary, the final particles have similar momenta and 
there is no reason for the exchange terms to be small. The final result for the 


+ The result obtained in the Born approximation is, of course, as usual valid only if the relative 
velocity of the final electrons is large in comparison with a. If not, the interaction of the particles in the 
final state has to be taken into account. 
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cross-section is 


EQmax- OP dO 


do = 2ar; 7 0, (97.17) 


In electron—positron scattering, a iogarithmically large contribution to the emis- 
sion cross-section comes from the squares of annihilation diagrams, in which there 
is emission by the initial particles: 


k 

| | 
A 

op ee ae ap ee en eas (97.18) 

Spies ha ps See ee 


The squares of other diagrams are significant when the accuracy is not logarithmic, 
but the interference terms are small. The final result is 


_ ; Ema — OF ( 2E.\ dO 
ie Ter a log — +1) = (97.19) 


Thus the emission in electron—positron scattering is logarithmically large in com- 
parison with that in electron—electron scattering. 


$98. Emission of soft photons in collisions 


Let doy be the cross-section for a given process of scattering of charged 
particles, which may be accompanied by the emission of a certain number of 
photons. Together with this process, we shall consider another which differs from it 
only in that one extra photon is emitted. If the frequency w of this photon is 
sufficiently small (the necessary conditions will be formulated below), the cross- 
section do for the second process is related in a simple manner to do». 

When w is small, we can neglect the influence of the emission of this quantum 
on the scattering process. The cross-section do can therefore be represented as a 
product of two independent factors, the cross-section do» and the probability dw of 
emission of a single photon in the collision. The emission of a soft photon is a 
quasi-classical process; the probability is therefore the same as the classically 
calculated number of quanta emitted in the collision, i.e. the same as the classical 
intensity (total energy) of emission dI, divided by w (= hw). Thus 


do = doy dl/w. (98.1) 


We shall show how this formula can be derived from the general rules of the 
diagram technique (J. M. Jauch and F. Rohrlich, 1954). 

The diagrams for the process involving an additional photon are obtained from 
those for the original process by adding an external photon line which “branches 
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off” from some (external or internal) electron line, i.e. by replacing 


kK\ 
\ 
m \ (98.2) 
p p-k p 


It is easily seen that the most important diagrams will be those in which this change 
is made in external electron lines. For, if p is the momentum of the external line 
(p?=m_’), then for small k we have also (p—k)’ =m’, ie. the factor G(p —k) 
added to the diagram is near its pole. 

For an initial electron line p the change (98.2) amounts to the following change 
in the reaction amplitude: 


u(p)>eV (477)G(p — k)(ye*)u(p) 


= eV (47) PEA (ye*)u(p) 


~ = eV (4a) 7B (ye* up). 


Since (yp)(ye*) = 2pe* — (ye*)(yp) and ypu(p) = mu(p), we obtain the following 
rule: 


u(p) > eV (4) PE wip), (98.3) 


Similarly, for a final electron line p', the replacement of 


\ 
\ 
p’ p’ p' tk 
in the diagram implies the change 
<A <p i DLC) 
i(p') > eV (47) ia(p’) ('k) (98.4) 


in the amplitude. 

In the rest of the diagram we can everywhere neglect the changes in the 
momenta of the lines as a result of the emission of the photon k. Here it is assumed 
that the photon energy w is always small in comparison with the energies of all the 
particles participating in the reaction (and in comparison with those of the hard 
photons, if any, that are emitted). 

Let the cross-section doo refer, say, to the scattering of an electron by a fixed 
nucleus (with possible emission of hard photons). The amplitude of this process, 
which will be conventionally called “‘elastic’’, is 


M §}) = a(p')Mu(p). 
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Making the successive substitutions (98.3) and (98.4) and adding the results, we 
obtain the bremsstrahlung amplitude for emission of the same hard photons 
together with a soft photon k:t 


M; = MiPev(aay(P © = a a (98.5) 


Accordingly, the cross-section is 


(p'e) (pe)|? dk 


set ° 2 Oo 
dao = do: 47e (p'k) (pk)| Qa)20° (98.6) 
Summation over polarizations of the photon k gives 
! z 3 
__,{_ Pp’. __p_ | _dk 
do = é Fras | Ane, doe. (98.7) 
In terms of three-dimensional quantities, this formula becomest 
_ vXn  vXn * dw do, 
do = o( a - | rt dora, (98.8) 


where n=k/q, and v and v’ are the initial and final velocities of the electron. We 
see that the coefficient of do. is in fact the same as the classical intensity of 
emission (cf. Fields (69.4)), divided by w, as already asserted in formula (98.1). 

The condition for the above formulae to be applicable is that not only is w small 
compared with « but also the momentum transfer q to the nucleus is large 
compared with the change 6q in this quantity due to the emission of the soft 
photon. We have 


dq = (p — p—k)— (p'— p)o=0 
= op’—k, 


where |8p’| ~ wd|p'|/de ~ w/v and |k|=. In the non-relativistic case (v <1), we 
therefore obtain the condition 


w/|q\v <1. (98.9) 


For scattering by a Coulomb potential (or by any potential that decreases slowly 
with increasing distance) |q|~ 1/p (where p is the impact parameter), and so this 
condition can also be written as wr <1, where 7 ~ p/v is the characteristic time of 
the collision. 


+t It should be noted that the difference term in this formula arises naturally from gauge invariance: 
the reaction amplitude must be unchanged when the polarization 4-vector e is replaced by e + constant x 
k. 

t To derive (98.8) it is convenient to return to (98.6), putting p = (e, ev), pk = ew(1—-v-n),...,e= 
(0, e), and then summing over polarizations by means of (45.4a). 
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In the ultra-relativistic case, the photons are emitted chiefly in directions near v 
and v’, as is seen from the denominators in (98.8). If the electron scattering angle 6 
is small, the directions of all three vectors p, p’, n are close together. Then 


[5q| = |5p'| — |kI 
“a(t 
v 
~ wm’|/e’, 


and, since |q| ~ <6, we obtain the condition 


2 
@m 
> — 1 
Oia eee (98.10) 


Because the formulae (98.5)-(98.8) are quasi-classical, they are valid for emis- 
sion by any charged particles, not necessarily electrons as assumed in the deriva- 
tion. In general, when several such particles take part in the reaction, formula (98.5) 
must be put in the form 


= Med pe —Be"\ 
M; = M¢ eVidr) d ZF a (98.11) 


where the summation is over all the particles (with charges Ze); formulae (98.6)- 
(98.8) are changed similarly. 
In particular, in the non-relativistic case 


My; a M‘ evn) > Z(v’ = v) -e*, (98. 12) 
For two particles, this formula becomes 


o \mi mM (98.13) 


q=m(v'—y), m = m,m,/(m,+ m)), 


where v and v’ are the relative velocities of the particles before and after the 
collision. From this, on integrating |M,;)|* over the directions of emission of the photon 
and summing over the directions of polarization of the photon, we find the 
nonrelativistic frequency distribution of the radiation: 


2 (ay lls 


dow = do.3=— ° 
3a Mm, M2 @ 


The above results can be generalized to the case of simultaneous emission of 
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several soft photons. For each photon there is an additional factor in M,;;, similar to 
the coefficient of M‘;? (98.5). This is easily seen directly for the example of two 
photons, say. The lines of the two emitted photons have to be added on external 
electron lines, and in two different orders, so that a diagram with external line p is 
replaced by two diagrams with the lines 


k ky 


Ben kK, ey \ Ko \ 
~ YO 
and 
p-k,-k, p-k, p p-k,;-k, Pp-k, Pp 
respectively. They contain the factors 
1 1 1 1 


Po a ee d ee a ee 
2(pki + pkz)2pky “" — 2(pki + pk:) 2pk; 
(the denominators of the electron propagators) respectively, and their sum is 


1 1 
2pk, 2pk,’ 


1.e. it is the product of two independent factors relating to the first and the second 
photon. Then, in the sum of all the diagrams, the terms combine (because of gauge 
invariance) to give the product of differences 


ee ane 
p’ky = pky/\p'k, pk. 


The cross-section for the process separates into factors in accordance with the 
factorization of the amplitude. The soft photons are therefore emitted in- 
dependently. The cross-section for emission of n soft photons can be written 


do = do. dw,... dWn, (98.14) 
where dw, dw2,... are the probabilities of individual emission of the photons k,, 
k,,.... When this formula is integrated over a finite range of values of the variables 


(frequencies and directions), the same for all quanta, a factor 1/n! must be included 
in order to take account of the identity of the photons. 

If the emission cross-section (98.1) is integrated over frequencies in some finite 
range from @ tO 2, the resulting expression is 


do ~ a log(w2/@) doe; (98.15) 


cf. (98.8). Here it is assumed that both frequencies are soft, and the possible values 
of w, are therefore limited by the condition for the method to be applicable. With 
logarithmic accuracy, however, we can put w2~ e, where « is the initial energy of 
the emitting particle. The values of w; have no lower limit, but on letting w;->0 we 
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see that the cross-section for emission of all possible soft quanta is infinite. Let us 
investigate the significance of this “infra-red catastrophe” (F. Bloch and A. 
Nordsieck, 1937). 

When 


a log(e/w;) 2 1, (98.16) 


we have do = do... This, however, means that perturbation theory is inapplicable, 
and do cannot be calculated as a quantity of a higher order of smallness than do... 
Thus in this case the small parameter must be taken as a log(e/w;), not a. 

The derivation of formulae (98.5) and (98.6) from perturbation theory is there- 
fore invalid at sufficiently low frequencies. The classical formula for the intensity 
dI (Fields (69.4)), on the other hand, becomes more nearly correct as w decreases. 
Hence formula (98.1) remains valid if its meaning is made somewhat more classical. 
In this formula it has been assumed that one photon is emitted. Then the energy 
lost by the particle as radiation is equal to w and the “relative energy loss 
cross-section” is w do/e or 


do. dI/e. (98.17) 


In reality, for sufficiently small w, the emission probability is not small, and the 
probability of emission of two or more photons is greater, not less, than the 
probability for one photon. Under these conditions, the expression (98.17) remains 
valid but the classical intensity dI determines, instead of the probability of 
emission of one photon, the mean number of emitted photons 


dn = dl/a, (98.18) 


or, in a finite range of frequencies, 
n= | dI/a. (98.19) 


Since the soft photons are emitted in a statistically independent manner (this 
being true in every approximation of perturbation theory), Poisson’s formula can 
be applied to the process of multiple emission: the probability w(n) that n photons 
are emitted is given in terms of the mean number f by 


w(n)=n"e"/n!. (98.20) 


The cross-section for a process of scattering with emission of photons may be 
written 


do = doa: w(n). (98.21) 


Since = w(n) = 1, do. is the total cross-section for scattering accompanied by the 


§98 Emission of Soft Photons in Collisions 437 


emission of any soft radiation. This is evident from a classical treatment; according 
to perturbation theory, however, do, is the purely elastic scattering cross-section. 
But perturbation theory is inapplicable here. Thus we find that do. calculated by 
perturbation theory as the elastic scattering cross-section actually includes the 
emission of any soft photons. The true value of the purely elastic scattering 
cross-section 1S zero: aS w,;—0, the mean number fi ~~, and according to (98.20) 
the probability of emission of any finite number of photons vanishes.{ 


PROBLEMS# 


PROBLEM 1. Find the spectral distribution of soft quanta emitted in ultra-relativistic electron- 
nucleus bremsstrahlung. 


SOLUTION. Integration of (98.8) over dox gives 


do = aF (€)(dw/w) doa, (1) 
where 
2 
PO == | aarp loee+ E+ 1-1], m 
E= P. sin 30, 


p being the electron momentum and @ the scattering angle. In the ultra-relativistic case, the most 
important range of angles is 


mole? <0<mile: (3) 


the lower limit is given by the condition (98.10), and the upper limit is discussed below. Here 
E =~ £6/2m <1, so that 


F(é) ~ (8/3 mE, 
and the electron—nucleus elastic scattering cross-section is (see (80.10)) 


2 
doa ~4Z2r = de (4) 


The integral 
dow =**Z’ ar? woe 
7) 3 e a Q 


diverges logarithmically; it is cut off below at angles @~ m/c’ and above at €~1, i.e. at angles 
86~ mle. When >“, F ~ (4/7) log € and the integral converges. Thus we have, with logarithmic 
accuracy, 


2 
do, = Zar? do log —, (5) 
@W @ 


which agrees with the logarithmic part of formula (93.17) (where we must put ce ~ ce’). Non-logarithmic 
accuracy can be achieved only by going beyond the quasi-classical range. 


+ We shall return to a more detailed discussion of this in §130, in connection with radiative 


corrections. 
+ The following applications of formula (98.7) are due to V. N. Baier and V. M. Galitskil (1964). 


438 Interaction of Electrons with Photons §99 


PROBLEM 2. For a collision between two ultra-relativistic electrons, determine (in the centre-of- 
mass system) the cross-section for simultaneous emission of two soft photons in opposite directions at 
small angles to the electron momenta. 


SOLUTION. Photons moving in opposite directions are emitted by different electrons, each in the 
direction of its motion. The cross-section for simultaneous emission is 


dw, dw. 

do = doa: aF (é) aF(é) ae (6) 
€ =(e/m) sin 28, 

where ¢ is the energy of each electron, 6 the scattering angle in the centre-of-mass system; @ is the 
same for each electron. No factor 7 is needed in the cross-section, since the photons are certainly 
emitted in different directions. The cross-section for elastic scattering of the electrons through small 
angles in the centre-of-mass system, in the ultra-relativistic case, is the same as (4); cf. (81.11). Unlike 
(1), the cross-section (6) behaves as 6d@ when 6-0, and the integral therefore converges. On the one 
hand, this enables us to extend the integration to @=0, without any difficulty that the method might 
cease to be applicable. On the other hand, the main contribution to the integrated cross-section now 
comes from the region 6 ~ m/e, not 6 <m/e, and so the exact expression (2) has to be used. The result 
of integrating the cross-section over scattering angles is 


2 dw dwr 
@1 W2 


dow,0) = 2 [5 +3203) ra 


2 2 dw; da 
a 5.9r-a names 
@1 W@W? 


the value of the Riemann zeta function being ((3) = 1.202. 


$99. The method of equivalent photons 


Let us compare two processes described by the diagrams 


(99.1) 
(a) | k (b) 
va. p 


where the circles represent the whole of the internal parts of the diagrams. Diagram 
(a) represents a collision between a photon k (k*=0) and a particle having 4- 
momentum q (and mass m; q’?=™m_’). The system resulting from the collision is a 
particle or group of particles having total 4-momentum Q. Diagram (b) represents a 
collision between the same particle q and another particle having 4-momentum p 
and mass M (p’=M?’). After the collision, the latter particle has 4-momentum p’, 
and the same system Q is formed. The second process may be regarded as a 
collision between the particle q and a virtual photon emitted by the particle p and 
having momentum k = p—p’ (k’<0). If |k|? is small, the virtual photon is not 
greatly different from a real photon. Such a situation is evidently possible in 
collisions of very fast particles: the electromagnetic field of a charged particle 
moving with v ~1 is almost transverse, and therefore has properties similar to 


§99 The Method of Equivalent Photons 439 


those of the field of a light wave. Under these conditions, the cross-section for 
process (b) can be expressed in terms of that for process (a).f 

We shall thus suppose that the particle M is ultra-relativistic, with energy (in 
the rest frame of the particle m) ¢ > M. If the masses of the colliding particles m 
and M are different, we shall take the case where m < M. 

The amplitude of process (a), which involves a real photon, can be written 


MiP =— eV (4r)(e,J"), (99.2) 


where e, is the photon polarization 4-vector and J* the transition current: cor- 
responding to the vertex (the circle) in the diagram. The amplitude of process (b) is 
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Mj = Ze ma GJ”), (99.3) 


where j, is the transition current of the particle m (the lower vertex in the diagram), 
and Ze is the charge on this particle. The current J is a function of k = Q —q, and 
is therefore not the same in the two cases, since k* = 0 in (99.2) and k?# 0 in (99.3). 
But if, in the second case, 


k>| <m?, (99.4) 


we can here also take J for k* = 0. 

The change in the momentum of the particle M when a virtual photon is 
emitted, p— p’ =k, is small in comparison with its initial momentum |p| ~ €; we can 
therefore put p = p’ in the transition current j. That is, the motion of the particle M 
may be regarded as uniform motion in a straight line. Since such a motion is 
quasi-classical, the corresponding current is independent of the spin of the par- 
ticle: 


j* =2p*. (99.5) 


The condition for the current to be transverse (jk = 0) now gives ew — p,k, = 0, 
the x-axis being taken in the direction of p. Hence 


w = vk,, (99.6) 

where v = p,/e is the velocity of the particle M. Since 
—~k?=-@’+k2+ki =o*(1-v’)+ki, (99.7) 
where k, is the component of the vector k transverse to the x-axis, the condition 


+ The method given below is due to C. F. von Weizsacker and E. J. Williams (1934); the basic idea 
had been stated earlier by E. Fermi (1924). 

+ When the wave functions are normalized to one particle in unit volume, the current j* = (1, v), 
where v is the velocity. We have, however, decided (§64) to omit the normalization factor 1/V(2e) in the 
wave functions. Accordingly, j* must include a further factor 2c, and this gives the expression (99.5). 
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(99.4) is equivalent to the inequality |k,|<m and to a considerably weaker 
inequality for wo: o <m/V(1— 0”). 

From the condition for the current J to be transverse (Jk =0) we have, using 
(99.6), 


J ,oitk KL 
as 60) 


Jo== 


We therefore obtain for the scalar product jJ 


JJ = 2(Joe — Jxpx) 


2 
~2= (ki -Ju+ oe Jn). (99.8) 
@®) 

The product Je in (99.2) can be expanded by taking the polarization 4-vector of 
the real photon in the three-dimensionally transverse gauge: ek =—e-k=0, 
whence e, ~ —e, °kK,/w. Then 

Je = e-(I.-+ aT e ) (99.9) 


The expressions (99.8) and (99.9) are proportional if the second terms in the 
parentheses are negligible. Since the current J pertains to the upper vertex of the 
diagram (99.1b), it does not depend on the direction of p; hence J, and J, must be 
taken to be quantities of the same order. For the terms in question to be negligible, 
therefore, we must have |k,|<w and w <e’|k,|/M’; these conditions are com- 
patible with the previous ones on k, and w. 

Assuming that in (99.9) the photon is polarized in the plane of x and k (so that 
e,||k.) and noting that the conditions stated imply that e? ~ e’ = 1, we now have 


) ZeV (471) 2e 
a rs 


M; = M¢ Ik, |. (99.10) 


In accordance with the previous discussion, the following conditions are here 
assumed satisfied: 


|k,|<w <my, (99.11) 
wy’ <|k,| <m, (99.12) 
with the notation 
y =e/M = 1/V(1-0”). 


From this we can find the relation between the corresponding cross-sections. 
According to the general formula (64.18) we have (in the rest frame of the 
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particle m) 


: l 
do, = |M $?|’(27r)*6(P; — Pi) Amo oPe 


d’p' 


do = |M;;|"(277)*8 OCP; — Pi) 2e(27) dpg, 


or 


where dpg represents the statistical weights of the particles Q. Using (99.10) and 
(99.7), we find 


do = do, - n(k)d’p', (99.13) 
where 
Zire" ki 
n(k) = ae? woke topyy (99.14) 


Here do, is the cross-section for process (a), resulting from a collision between a 
real photon and a particle at rest, in which a system of particles Q is formed which 
have momenta in certain ranges; do refers to the process (b) of formation of the 
same system Q when a fast particle (of mass M) collides with the same particle at 
rest, loses momentum p — p’ = k, and remains in the range d’p’ of values of p’. The 
factor n(k) in (99.13) may be interpreted as the number density (in k-space) of the 
photons equivalent to the electromagnetic field of the fast particle. 

The integration over d’p' is equivalent to one over d’k = dw d’k,. On integrat- 
ing over d’k,, we obtain the cross-section for a process in which the total energy E 
of the system of particles Q lies in a given range dE =dw (E-—m=e-&'=a, 
where « and «’ are the initial and final energies of the particle M). Integration over 
the directions of k, signifies averaging over the directions of polarization of the 
incident photon (and multiplying by 277). The result is 


do = n(w) do, dw, (99.15) 


where 


n(w) = n(k)27k, dk, 


= Ze k3. eS 
(k2 + w/y’P ly*) 


The integral over dk, diverges when k, is large, but the divergence is only 
logarithmic. This enables us (within the range of validity of the method) to obtain a 
result in the logarithmic approximation: it is assumed that not only the argument of 
the logarithm but the logarithm itself is large. To this accuracy, it is sufficient to 
take as the upper limit of integration kimax~ m, the upper limit of the inequality 
(99.12). Integration then gives for the frequency distribution of equivalent photons 
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_2 ymc* dw 
n(w) dw = = Za log i 


(99.16) 


The approximation used here signifies that the numerical coefficient in the 
argument of the logarithm remains indeterminate. The inclusion of such a 
coefficient would mean the addition of a relatively small quantity (~1) to the large 
logarithm and would be superfluous having regard to the accuracy of the method. 


PROBLEMS 
PROBLEM 1. From the photon—electron scattering cross-section, find the bremsstrahlung cross- 
section in a collision between a fast electron and a nucleus. 


SOLUTION. In the frame of reference K, in which the electron is at rest before the collision, the 
process may be regarded as the scattering by the electron of the equivalent photons of the field of the 
nucleus.f According to (86.10) the cross-section for scattering of a photon by an electron in the frame 
K, is 


! ! 2 \ 
deren, ty = ar2 APH [4 Ey (MM) aq (L—2)) (1 
1 


wi ow: \@ @ Wi Wi 


where wi and w} are the initial and final energies of the photon in this frame. The bremsstrahlung 
cross-section in the frame K;, is 


dop(wi) = | dw; - n(a1) dosc(wi, 1), (2) 


where n(qw;) is the function (99.16). Since the cross-section is invariant, the change to a frame K in 
which the nucleus is at rest involves only a change in the frequency w{. The frequencies w} and w’ in the 
frames K, and K are related by the Doppler formula 

w'=yoi(1-vcos 6), y=1/V(-0v, (3) 


where 6 is the scattering angle in the frame Ki. The same angle relates w{ and w; according to (86.8): 


ae ee ee 
oa oe (1 —cos 61). (4) 
From (3) and (4) we have 
wi = wie'le, (5) 


where « (=my) and e’ are the initial and final energies of the electron in the frame K (e — €'= w’). 
Substituting (5) in (1), we find 


2 
dom write (84 £4 a” 2a 
sc é€ EW] ’ ° 


joe ! 
E E E W@W] WIE 


This expression is to be substituted in (2) and the integration over dw; carried out with w’ (i.e. ¢') fixed, 
the range being from @1,min= mw'/2€' tO wi,max = 2ew'/m; these values are given by (3) and (4) with 
6; =0 and 6; = 7. Because the integral converges rapidly for large w, the main contribution to it comes 


+ The scattering of virtual photons by the nucleus (in the rest frame of the nucleus) is excluded by 
the large mass of the latter: the scattering cross-section tends to zero with increasing mass of the 
scattering particle. 
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from the range of w: near the lower limit, i.e. we may put @1,max—>®. Calculating the integral with 
logarithmic accuracyt, we have 


aa € 2 ee! 
r e yuna ae 2 cee Peasy 
dow: = Ar? al — 3 ‘(S45 —§) lo a 
For this result to be valid, besides the condition ¢« >m (ultra-relativistic electron), the condition 
(99.11) must also be satisfied: the frequencies w) ~ 1, min important in the integration must be «e. Hence 
e —¢e' =o’ <ee'/m. Under these conditions the result agrees (to logarithmic accuracy) with (93.17), as it 
should. 


PROBLEM 2. The same as Problem 1, but for electron—electron bremsstrahlung. 


SOLUTION. In this case, the virtual photon can be scattered either by the fast electron or by the 
recoil electron; the photons equivalent to the field of either electron are scattered by the other. The 
scattering of virtual photons by the fast electron gives the cross-section dot, which is equal to the 
cross-section for an electron and a nucleus with Z = 1. 

The scattering of virtual photons by the recoil electron gives a cross-section 


oa? a - | dw: n(@) dos(w, w i; 


with do;-(w, w') given by (1) with the appropriate change of notation for the frequencies. The range of 
values of w for given w’ is (cf. (4)) 


1 
wo =w so for w' >3m, 


wo <o<w'/(m—20') for w'< mM. 


When w' <im, integration with respect to w gives 


dw' w' ow” 
do = tari dH (1-24 8%) og, 
a of Int m7) 8 
in agreement with (97.4). But when w' >3m we must distinguish the cases w’ ~ m and w’ ~ e > m. In the 
former case, 


m da’ m m E 
da? = 3 a 4— m+ Fea) lo roe 
br 3Qle ' 4 ' g 1’ 


in agreement with (97.3); in the argument of the logarithm we have, with sufficient accuracy, replaced 
e/w' by e/m. In the case w' ~ e, the method of equivalent photons is not valid for calculating dot?. The 
frequency w of the virtual photons takes values beginning with w', and the condition (99.11) is therefore 
not satisfied when w = w'~ e. 


PROBLEM 3. Determine the total pair production cross-section in a photon—nucleus collision from 
the pair production cross-section in a collision between two photons. 


SOLUTION. The energy of the photon in the rest frame K of the nucleus is w > m. If we change to a 
frame Ko in which the nucleus moves to meet the photon at a speed vo such that 1/V(1 — v6) = >w/m, then 
in this frame the photon energy is 


_ Loo: a es 
oo = @ Tq gp TOV vo)=m 


The required cross-section o is calculated in the frame Ko as the pair production cross-section in 
collisions between an incident photon wo and the equivalent photons of the nucleus, whose energy we 
denote by w’: 


o= i Oyyn(w’) dw’, 


{+ This means that, by one integration by parts, the term containing the large logarithm is separated 
and the remaining terms then neglected. This operation is equivalent to taking the logarithm log(e/w1) 
outside the integral, with w) = 1, min. 
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where o,, is the cross-section for pair production by two photons and is given by §§88, Problem, 
formula (1), with 


v= V(1—m’/wow’) = V(1— mio’). 


Changing to the variable v instead of w', we have 


1 


o = 2reaZ | v loglw(1- vm} — v*) log 


0 


Peo. ogee 
i—% 2v0(2 o°)} de. 


Because of the convergence at the upper limit, the integral may be taken over the whole range from 
the reaction threshold w'=m (v=0) to w'=& (v=1) and with logarithmic accuracy (replacing 
log[w(1— v’)/m] by its value for v = 0 and taking it outside the integral). The result is 

o = $aZ’r? log(w/m), 


in agreement with (94.6); this formula is valid when log(w/m) > 1. 


§100. Pair production in collisions between particles 


Electron—positron pair production in a collision between two charged particles 
is described by diagrams of two types: 


ae ra oe 
| 
! | | (100.1) 
une) Care RENE CE ae aN os cg 
p_ -p, Pp.  -p, 


(a) (b) 


The two upper continuous lines in each diagram correspond to the colliding 
particles, and the lowest line to the pair formed. 

Let us consider a collision of two heavy particles (nuclei) in the ultra-relativistic 
case. The change of the motion of the particles themselves in such a collision may 
be neglected, i.e. they may be regarded as external-field sources.t This corresponds 
to two diagrams of the first type: 


| 
(100.2) 


where q°”, q® are the “momenta” of the Fourier components of the fields of the 
two particles. 


The potential A" =(Ao, A) due to a classical particle moving with a uniform 


+t The collision of two light particles (electrons), where the change in the motion cannot be 
neglected, is a considerably more complicated case; see the book by Baier et al. cited in §93. 
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velocity v satisfies the equations 
[JAg = —40Zed(r — vt — ro}. 
LIA = —4mZevi(r — vt — ro). 


its Fourier components are 


8a*Ze 


2 5 
w —k 


Aol, k) — = 


e* §(m —k-v), 


and similarly for A(@, k). In four-dimensional form, 


_ 82’*Ze 
aso ae 


A"(q) = e'*U"8(Uq), 


where U is the 4-velocity of the particle, and the 4-vector x, = (0, ro). If nucleus | is 
at rest at the origin (r§’ = 0), then p=rJ is the impact parameter vector (in a plane 
perpendicular to the direction of motion of nucleus 2). This expression for A“(q) ts 
to be used in writing analytically the diagrams (100.2). 

There is, however, no need to use this method for the actual calculations in the 
present case. The pair production cross-section may be determined by the method 
of equivalent photons, using the already known photon—nucleus pair production 
cross-section. The replacement of the field of one particle (the first, say) by a 
spectrum of equivalent photons implies that in the diagrams (100.2) the lines q‘” are 
regarded as real-photon lines. The two diagrams then become identical with the 
diagrams corresponding to pair production by a photon at nucleus 2. When «4, 
e_>m, the cross-section for the latter process is given by (94.5). Multiplying this 
cross-section by the spectrum (99.16) of equivalent photons of the first nucleus, we 
obtain (with logarithmic accuracy) for the differential cross-section for pair 
production in a collision between particles 


_8 » de,de_ 4 Doniuad E+ E_ my 
do = a r(Z,Z2a) es (e, + €2 + 3e,e_) log Ge.) log es 
(100.3) 
where y = 1/V(1— v’) > 1. 
Here it is assumed that 
mM <e&4,6.-<my; (100.4) 


the right-hand inequality is the condition for the method of equivalent photons to 
be applicable. The range defined by the inequalities (100.4) is the same as the 
electron and positron energy range which is important in the integration of (100.3). 
On integration over de, or de_ for a given sum ¢ =e,+ &_ (>m), the important 
range is the one near the upper limit; omitting terms which do not contain the large 


QE4- DD 
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logarithm, we find 


36 


6 2 2 op © log My dé 
on r(ZiZ.a) log =a log : 


E—€ € 


do = 


The integral with respect to « over the range (100.4) diverges as the cube of the 
logarithm, but only as the square of the logarithm at the boundaries of the range. In 
the logarithmic approximation (log y > 1), therefore, the range (100.4) is in fact the 
most important one, and the integral can be taken over the range from m to my. 
Since 


Y 
d 
| log (log y — log &) S = log’ y, 
1 


the total pair production cross-section is 


“33 MAO 24.3 1 
c= 77 r&(Z,Z2.a) log Va-0) (100.5) 


(L. D. Landau and E. M. Lifshitz, 1934). 

Let us now consider the case of non-relativistic velocities of the colliding 
nuclei. The change in their motion due to their interaction then becomes important, 
and the main contribution to the pair production cross-section comes from 
diagrams of the second type in (100.1). There are four such diagrams: two of them 
are 


aa an Sena 7 DS ee 
| | 
| | 
>, Sa ee ee P, —~1—_.-—__,_--— P, (100.6) 
ly ,k 


and the other two are similar except that the virtual photon k (which produces the 
pair) is emitted by the first nucleus and not by the second.t 

We shall suppose that the energy of the pair is small compared with the kinetic 
energy of the relative motion of the nuclei in their centre-of-mass system: 


€,+¢e_<4Mv’, (100.7) 


where v is the initial relative velocity and M = M,M,/(M, + M2) is the reduced mass 
of the nuclei. Then the reciprocal effect of pair production on the motion of the 
nuclei can be neglected. If the electron—positron line in the diagrams (100.6) is 
omitted, the remainder will represent the emission by the colliding particles of a 


t+ Altogether 36 diagrams correspond to pair production in a collision between two electrons: 
2! 3!=12 diagrams of type a, differing by interchanges of the two initial and three final electrons, and 
2x 2!x 3!=24 diagrams of type b, obtained in a similar way from the two diagrams (100.6). 
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low-frequency virtual photon (@ = e«,+e«_). Thus we return to the situation dis- 
cussed in §98 for the emission of a real soft photon, and can use the formula (98.13) 
derived there for the non-relativistic case (except that the amplitude V(4m)e* of 
the real photon will be replaced by the virtual photon propagator).t Thus the 
amplitude of the whole process of pair production becomes 


a 1 /(Z Z ba caes 
My = MP (Se Sq" Du lol-ie@_y"u.)), (100.8) 


60) 


where q = (0, q), q= M(v' —v). 
As usual, the photon propagator in the non-relativistic case is to be taken in the 
gauge (76.14). From the amplitude (100.8) we find the cross-section for the process: 


a ee Z._ Z2\'_d psd ps, ne ) 
do doe é (a M> Qe, ; 2¢_(27r)°w*(@? — k’)’ (47) li_y Qu. - 
(100.9) 
where 
1 
w=ete, k=ptp, Q=q-—3k(q-k); 


do, is the cross-section for elastic scattering of one nucleus by the other, in their 
centre-of-mass system, and is given by Rutherford’s formula: 


doe ad 4(Z,Z,e*)M’ dolq’ 
~ 4(Z Ze?) — (100.10) 


the last equation assumes that the deviation of the nuclei from their original 
direction of motion (the x-axis) is small. Substituting this expression in (100.9) and 
summing over polarizations of the pair in the usual manner, we obtain 


4 Z 
bake ~\2 € Zi at =) 
do = (Z,Z2e ) De (7. ive x 
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4tr*e.e_q 


The remaining calculation is made in the approximation in which all the 


+ In the non-relativistic case, the photon momentum is small in comparison with the change in 
momentum of the radiating particles (|5p| ~ w/v), and can therefore be neglected, in comparison with 5p, 
even when the photon energy is not neglected. This applies a fortiori here to the virtual photon, for 
which the four-dimensional square k? = (p+ p-)’ > 0, so that |k| < w. Under these conditions there is no 
difference between real and virtual photons, and the use of formula (98.13) is thereby justified. 

+ The diagrams (100.6) are shown on the assumption of the Born approximation for scattering of 
nuclei. But, since Rutherford’s formula is exact (for Coulomb interaction), the validity of the results 
obtained does not in fact depend on the fulfilment of the condition for the Born approximation to be 
valid. 
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logarithms occurring in the integration are assumed large. We shall see that, to this 
accuracy, pair energies ¢,, «_>m and angles 6 between p, and p_ such that 


mile <@<1 (100.12) 


are the most important. With the appropriate approximations, the calculation of the 
trace in (100.11) gives 


tr{.. .}= 4[(e.e. — pi p(a- 10) + 


sae 


+ 2p. -a)(p-- q) + =F= (q: “EK (e,g-p + ea: ps], 


where we can also put |p.| = e4, |p_| = e_. In the denominator, 


> (E44 e.)° 
ae ar 


w?—k’~ e,e_6? +m 
Integration over the directions of p. and p-_, for a given angle between them, gives 


8 4 
do = 35 (ZiZ,¢° 5 (2-2) (c +67) de, de_xX 


3 
“Te + aXe. rey ‘ =e (100.13) 
The form of the dependence on @ confirms the hypothesis (100.12), and 
integration with respect to 6 gives logle,e_/m(e,+ e_)]. Integration of the last 
factor in (100.13) is from q, = q; =0 to V(q? + q2) ~ 1/R, where R is a quantity of 
the order of the radius of the nuclei (corresponding to the smallest impact 
parameters; see below). This integration gives 


[mw log(qy + qy + q2)1R=e=0* ~ 2 log Ra: 
The total energy of the pair, equal to the change in the energy of the nuclei, is 


=(e,+ e_)= 1M (v” Vv) Mov(vx— vx) = 0q;, 


whence q, = e/v. Thus we find 
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and, after integration over de, or de_ with a given sum ge, 


4,2 2 

do = 5~ (Zidre¥ Se (Ft) log Flog = =. (100.14) 

The energy « may be correlated with the impact parameter p ~ v/e; the pair 
energy is of the order of the frequency which corresponds to the collision time. 
Hence the logarithmic divergence on integration over de in (100.14) implies a 
similar divergence with respect to impact parameters. This means that large values 
of p are important (and this, incidentally, justifies the use of the cross-section 
(100.10) for scattering in the purely Coulomb field of the nucleus). Accordingly, the 
important range of energy is given by m <e <v/R. Integration of (100.14) gives the 
total pair production cross-section; the final result is (in ordinary units) 


_ 16 2 (£) (2-4) 3_hw 
a (Z,2Z.a) 7% ; M, M, log mc2R (100.15) 


(E. M. Lifshitz, 1935). 


§101. Emission of a photon by an electron in the field of a 
strong electromagnetic wave 


The application of perturbation theory to processes of interaction between an 
electron and a radiation field requires not only that the interaction constant a 
should be small but also that the field should be sufficiently weak. If a is the 
amplitude of the classical 4-potential of an electromagnetic wave field, the charac- 
teristic quantity in this respect is the dimensionless invariant ratio 


€ = eV(—a’)/m. (101.1) 


In this section we shall consider emission processes occurring in the interaction 
of an electron with a field of a strong electromagnetic wave, for which € can have 
any value. The method used is based on an exact treatment of this interaction; the 
interaction of the electron with the newly emitted photons can, as before, be 
regarded as a small perturbation (A. I. Nikishov and V. I. Ritus, 1964). 

Let us consider a monochromatic plane wave, say a circularly polarized one. Its 
4-potential may be written in the form 


A=a,cos ¢ + a2 sin d, @ = kx, (101.2) 


where k" = (w,k) is the wave 4-vector (k? = 0), and the 4-amplitudes a, and a» are 
equal in magnitude and orthogonal: 


=a. a,ar= 0. 


~ A numerical error was corrected by L. B. Okun’ (1953). 
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We shall assume that the Lorentz gauge condition is applied to the potential, so 
that a,;k = ak =0. 

The exact wave function for an electron in the field of an arbitrary plane 
electromagnetic wave has been derived in 840, and is given by formulae (40.7) and 
(40.8). We shall, however, change the normalization by making %, correspond to 
unit mean spatial number density of particles, in the same way as the wave 
functions of free particles are normalized to “one particle in unit volume’’. Since 
the mean density for the function (40.7) is jo = qo/po, in order to obtain the required 
normalization this function must be multiplied by V(po/qpo), ie. the factor 1/V/(2po) 
in (40.7) must be replaced by 1//(2q)). For a wave with the 4-potential (101.2), we 
find 


by = [1+ 5cesy (Crk) cos 6 + (okDC yas) sin 9] HP x 
x expy ~ i = ee sin d + ie qe cos f — igx |, (101.3) 


where 


2 


H = pH — 92" __ a 101.4 


According to (40.14), the 4-vector q is the mean 4-momentum of the electron; we 
shall call it the quasi-momentum. 

The S-matrix element for a transition of the electron from the state yw, to the 
state ys, with emission of a photon having 4-momentum k” = (w’, k’) and polariza- 
tion 4-vector e’ is 


ik’x 
.f - , € 
— le | Wy (ye’™) Wp V (0) d*x. (101.5) 
The integrand in (101.5) is a linear combination of the quantities 


—ia, sin d+ia cos 
€ , 


cos d : ge HERON 
sin : le 
where 
_ ,(&P aip' _ (@2p _ arp’ 
ie = er). ge (Te al (101.6) 


These quantities, together with the factor exp[i(k'+p’—p)x], give the whole 
dependence of the integrand on x. We expand them in Fourier series, denoting the 
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expansion coefficients by B,, B,,, Bx, respectively; for example, 


e —ia, sin tia cos @ _ e —iV (a}+ a4) sin(@—dg) 


= DB en, 


sS=—® 
These coefficients can be expressed in terms of Bessel functions by the formulae 


B,= J;(Z) eres, 
By, = [Je4i(z) e879 + J,_1(z) ef D9], (101.7) 


Ba, = 5- [Ue ss(z) e040 — J, 4(z) °°, 


where 
z=V(aitas),  cosdy=aj/z, sin do = ap/z. 
The functions B,, B,,, B2, are related by 
a, B,, + a2B 2, = sB,, (101.8) 
which follows from the familiar relation 
J5-1(Z) + Js 41(Z) = 28J;(z)/z 


between the Bessel functions. 
The matrix element (101.5) then becomes 


eee 1 S) 4ea(4) ae eee 
Si Qa’ 2qo> 240) 2 Mi (27r)*i8 (sk + q — q’—k’); (101.9) 


we shall not give here the fairly complicated expressions for the amplitudes M$. 
Thus S;; is an infinite sum of terms, each corresponding to a conservation law 


sk+q=q't+k’. (101.10) 
Since 
q@=q?=mM1+2)=m2 (101.11) 


(cf. (40.15)), and k? = k” = 0, the equation (101.10) can be satisfied only if s = 1. The 
sth term of the sum describes the emission of a photon k’ by the absorption from 
the wave of s photons with 4-momenta k. The form (101.10) shows that all the 
kinematic relationships which occur for the Compton effect will apply to the proces- 
ses considered here if the electron momenta are replaced by the quasi-momenta q 
and the incident photon momentum by the 4-vector sk. In particular, the frequency 
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of the emitted photon in the frame of reference where the electron is at rest on 
average (q=0, qo = mx) 18 


_ Sw 
ach (sw/m,.)(1 — cos 6)’ (101.12) 


where @ is the angle between k and k’; cf. (86.8). We may say that the frequencies 
w' are harmonics of w. 

In the notation previously used ($64), the amplitude of the process of emission 
of the sth harmonic is M}, and the expression 


d°k' d?q' 


——---- 40(4) eee 
(2ar)° + 2w' -2q0-2qh (27)'6"(sk + q — q'—k’) (101.13) 


dW, =|M¥?/’ 


gives the corresponding differential probability per unit volume and unit time.t 

The amplitudes M'!) have a structure similar to that of the scattering amplitudes 
with plane waves, l(p’)...u(p); the operations of summation over polarizations of 
the particles are therefore carried out in the usual manner. After summation over 
the polarizations of the final electrons and the photon and averaging over the 
polarizations of the initial electron, we have 


_ em? d?k’ d3q 


(4) ee eee 
dW, a aay 6"(sk +q—q'—k')x 
kk’) 5 
1 Dye) * (1 + see s) pata ~ 253), 101.14 


In order to integrate this expression, we note that, owing to the axial symmetry 
of the field of a circularly polarized wave, the differential probability is independent 
of the azimuthal angle @ around the direction of k. This fact, together with the 
presence of the delta function, enables us to integrate over all variables except one, 
which we take to be the invariant u = (kk')/(kp’). Then, after integration over 
d°k db dfai+ w'}, we find 


d’q'd*k' 22mdu 

(4) ee es ee 

6°(sk + q—q'—k’') qh’ eye 

For, in the centre-of-mass system (in which sk+q=q+k' =O), this integration 
gives 27|q'ld cos 0/E,, where E, = sw + qo= w'+ qo and @ is the angle between k 
and q’; cf. the transformation (64.12). In the same system, moreover, 


ee: eee ios 
qi—|q'|cos@ lq + uy” 


+ It should be noted that the normalization of the functions y to unit density corresponds to 
normalization by the delta function “on the q/27 scale’’; cf. (40.17), where the factor qo/po on the right 
will now be absent. It is for this reason that the number of final states of the electron must be measured 
by the element d*q’/(27)’. 
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The range —1<cos 6 <1 corresponds to 


th 2 2 


= 2s(kp)/m;; 


in making the transformations it must be remembered that kp = kq. 
Thus the total probability of emission from unit volume in unit time is 


Wed wi at - 492) + (243 Monta 1-29}, 
(101.15) 
wheret 
u = (kk')/(kp’), u, = 2s(kp)/m, 
(101.16) 


= 29m; Vi (1~) | 


When & <1 (the condition for perturbation theory to be valid), the integrand in 
(101.15) can be expanded in powers of & For example, the first term in the 
expansion of W; is 


_e'm’ ,, 4 8 1, 8 1 
= #| (1 i i)log(t tui) +547 ere A (101.17) 


with u, ~ 2(kp)/m*. This result agrees, as it should, with the Klein—-Nishina formula 
for the scattering of a photon by an electron: putting in (101.17) —a’=4z/a, 
E° = 47re’/m’w, and dividing by the incident flux density (64.14), we return to (86.16) 
(the integrated scattering cross-section is independent of the initial polarization of 
the photon).¢ 

The expression for the probability of emission of the second harmonic (the first 
term in the expansion of W, for é <1) is 


+ To calculate z, we first note that 
z’ = (aiQ)’ + (a2Q)’ = a’Q’, 


where Q = q/(kq) — q'/(kq'). This is easily shown by choosing a frame of reference in which (ai)o =, 
(a2)o = 0 and the vectors ai, a, k are along the axes x', x’, x’, and noting that Qo = Qs; because kQ = 0. 

t This value of a* corresponds to normalization of the 4- potential to “one particle in unit volume”’. 
To determine it, w must be equated to the energy of a classical field with the (real) 4-potential (101.2). 
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a) 
_ e'm’é* | duu u u’ u u 
ee aera Al S 


2°3u, ue us 2(1+2u,) 


I 3 3 1 


The leading term in W, for fairly small s is proportional to é**. 

Let us now consider the opposite case (> 1). The parameter & can be made 
large, for instance, by decreasing the frequency w with a fixed field strength; 
evidently € = eF/ma, where F is the amplitude of the field strength. It is therefore 
clear that the case € > 1 essentially refers to processes in a constant and uniform 
field where E and H are orthogonal and equal in magnitude; this will be called a 
crossed field. The probability of emission in this field can be found by taking the 
limit € > ©, but it is simpler to assume a constant field in the calculations, taking the 
4-potential in the form 


A" =a"d, db = kx, ak =0 (101.19) 


(so that F,, = k,a, — k,a, = constant). The exact wave function of the electron in 
this field is obtained by substituting (101.19) in (40.7), (40.8): 


_ (yk)(ya) , ] _u(p) _+, (ap) po, -0 @ a3, 
Wp Ji+e (kp) verntaea le (kp) ° + ie 6(kp) ° ipx (101.20) 


The result given by using this function is exact for emission by an electron with 
any energy in a crossed field. However, in the ultra-relativistic case this result 
(when put in the appropriate form; see below) applies to emission by an electron 
not only in a crossed field but in any constant and uniform electromagnetic field, 
including a constant magnetic field as discussed in $90. 

To formulate this assertion we note that the state of a particle in any constant 
and uniform field is defined by as many quantum numbers as the state of a free 
particle, and these may always be so chosen as to become, when the field is 
removed, those of a free particle, i.e. its 4-momentum p” (p’?= m’). Thus the state 
of a particle in a constant field is described by a constant 4-vector p. 

The total intensity of emission, being an invariant, depends only on the 
invariants which can be constructed from the constant 4-tensor F,, and the 
constant 4-vector p”. Since F,,, can appear in the intensity only in combination with 
the charge e, we obtain three dimensionless invariants: 


2 e? v\2 _ e? a 2 
x? =~, Fup’) =~ 5 (kp. 


f =e(F,,y/m*, (101.21) 


2 
e 
B= a eau ad ae 
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In a crossed field f = g =0, whereas in general all three invariants are non-zero. If 
the electron 1s ultra-relativistic (pp) >m), however, and the vector p makes angles 
6 > m/po with the fields E and H, then y*>f, g (that is, for an ultra-relativistic 
particle any constant field appears to be a crossed field for almost all directions p). 
If also the fields |E|, |H|<m7/e (= m/’c*/eh), then |f|, |g|«<1.+ Under these con- 
ditions the intensity calculated for a crossed field and expressed in terms of the 
invariant x will apply also to the emission in any constant field. 
The invariant x is given in terms of the fields E and H by 


2 
x= {(p X H+ poE)’ — (p- BY}. 


For a constant magnetic field, y is equal to the quantity (90.3), and the above 
arguments are therefore another means of deriving the results in 890.4 


+ And p in y may be regarded, with the same accuracy, as being the ordinary 4-momentum of the 
particle. 

+t A detailed account of the theory of various processes in strong fields is given in the review papers 
by A. I. Nikishov and V. I. Ritus in Proceedings (Trudy) of the P. N. Lebedev Physics Institute, Vol. 
111, pp. 5 and 152. 


CHAPTER XI 


EXACT PROPAGATORS AND VERTEX PARTS 


§ 102. Field operators in the Heisenberg representation 


HITHERTO, in considering various specific processes in electrodynamics, we have 
used only the first non-vanishing approximation of perturbation theory. We shall 
now go on to discuss the effects which occur in higher approximations. These are 
called radiative corrections. 

A better understanding of the structure of the higher approximations can be 
obtained by first examining some general properties of exact scattering amplitudes 
(i.e. those which have not been expanded in powers of e’). We have seen in §72 
that the successive terms of the series in perturbation theory can be expressed 
in terms of the field operators in the interaction representation, whose time depen- 
dence is determined by the Hamiltonian Hy of a system of free particles. The exact 
scattering amplitudes, however, are more conveniently expressed in terms of the 
field operators in the Heisenberg representation, where the time dependence is 
determined by the exact Hamiltonian H =H)+V of a system of interacting 
particles. 

The general rule for constructing the Heisenberg operators gives 


h(x) = b(t,r) = ee ™, (102.1) 
and similarly for h(x) and A(x), w(r), etc., being time-independent (Schrodinger) 
operators.f It may be noted immediately that the Heisenberg operators for a given 


time obey the same commutation rules as the operators in the Schrédinger 
representation or the interaction representation: for example, 


{hi(t, rbe(t, ey}, = edi (nr), ile ce" = y% 8(r- '); (102.2) 
cf. (75.6). Similarly, the operators s(t, r) and A(t,r’) commute: 
{Whi(t, r), A(t, r)} = 0, 


but this does not hold good for operators pertaining to different times. 
The “equation of motion” satisfied by the Heisenberg wW-operator can be 
derived from the general formula QM, (13.7): 


i ee) = HG) 4. (102.3) 


+ In this chapter, operators with a time argument belong to the Heisenberg representation; those in 
the interaction representation will be given the suffix int. 
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The Schroddinger and Heisenberg representations are the same as regards the 
Hamiltonian, which is expressed in the same way in terms of the field operators. 
Here, to calculate the right-hand side of (102.3), we may omit from the Hamiltonian 
the part which depends only on the operator A(x) (the Hamiltonian of the free 
electromagnetic field), since this part commutes with s(x). According to (21.13) and 
(43.3), 


H =| dt. Na p+ BmybGn) dx +e [ GG. 5OAG mrbe.n ds 
= | HU, yp +m + eA iD) ax. (102.4) 


When the commutator {H, b(t, r)}- is calculated from (102.2) and the deita func- 
tion is eliminated by integration over d°x, we get 


(yp —eyA — m)W(t, r) = 0. (102.5) 


As we should expect, the operator #(t, r) satisfies an equation which is formally the 
same as Dirac’s equation. 

The equation for the electromagnetic field operator A(t,r) is obvious from the 
correlation with the classical case. When that case applies, i.e. when the occupation 
numbers are large (cf. §5), the operator equation must become the classical 
Maxwell’s equation for the potentials, Fields (30.2), after averaging over the state 
of the field. It is therefore clear that the equation for the operator is simply the 
same as Maxwell’s equation, so that we have (for an arbitrary gauge) 


a”a,A"(x) — a" d,A°(x) = —47ej"(x), (102.6) 
where j’(x) = W(x) y’ w(x) is the current operator, satisfying identically the equa- 
tion of continuity 

a,1°(x) =0. (102.7) 
It is important to note that the equations (102.6) are linear in A“ and j”, and the 
question of the sequence of these operators therefore does not arise. 
Like the similar equations for wave functions, the operator equations (102.6) 
and (102.7) are invariant under the gauge transformation 
A, (x) > Ay (x) — 4,8(x), 
px) baye™, (102.8) 
Ware po, | 
where y(x) is any Hermitian operator which commutes (at a particular time) with 
wt 


+ This refers specifically to the Heisenberg w-operators. In the interaction representation, the gauge 
transformation of the electromagnetic potentials does not affect the y-operators. 
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Let us now ascertain the relationship between the operators in the Heisenberg 
representation and those in the interaction representation. To simplify the dis- 
cussion, it is convenient to make the formal assumption (which will not affect the 
final result) that the interaction V(t) is adiabatically “switched on” from t = —@ to 
finite times. Then the Heisenberg and interaction representations are the same for 
t > —oo, and the wave functions of the system, ® and ®,,, are the same: 


Pin(t = —%) = @. (102.9) 


But the wave function in the Heisenberg representation is independent of time 
(since the whole of the time dependence is in the operators); in the interaction 
representation, the time dependence of the wave function is given by (72.7): 


Pin (t) = S(t, —%)Pin(—), (102.10) 


where 


ty 
(tr, i) = Texp|—i i V(t’) ar'|, (102.11) 
t 


1 


and the following properties of S are obvious: 


S(t, t)S(t, to) = SU, to), | (102.12) 
S(t, th) = S(t, t). 


Comparison of (102.10) and (102.9) gives 
O,(t) = S(t, —©)® (102.13) 


as the relation between the wave functions in the two representations. The operator 
transformation formula is similarly 


b(t, r) = S(t, —) pint, r)S(t, —©) 


(102. 14) 
a S(-%, t) in(t, r)S(t, =65), 


and likewise for w and A. 

One further general remark may be added. It has already been mentioned more 
than once that, in relativistic quantum theory, the physical significance of the field 
operators is very limited because the zero-point fluctuations are infinite. This is 
even more true of operators in the Heisenberg representation, which contain also 
divergences due to the interaction. In this chapter, §$102-109 deal with the formal 
theory, which ignores the question of eliminating these singularities and which 
treats all quantities as if they were finite. The results thus obtained have mainly 
heuristic value: they lead to a fuller understanding of the significance of the 
expansions given by perturbation theory, and they may also remain valid in some 
form in a future theory which is free from the present difficulties. 
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§ 103. The exact photon propagator 


The concepts of exact propagators play a central role in the formalism of the 
exact theory (i.e. without expansion in powers of e’).t 
The exact photon propagator (denoted by the script letter @) is defined by 


Duy(x — x') = iO|TA, (x) AL(x’)0), (103.1) 
where A, (x) are Heisenberg operators, in contrast to the definition (76.1): 
Dyy(x — x') = iO[TAR(x) AT(x')|0), (103.2) 


in which the operators in the interaction representation were used. The function 
(103.2) may be called the free (or bare)-photon propagator to distinguish it from 
the exact propagator (103.1). 

Since the mean value in (103.1) cannot be exactly calculated, it is impossible to 
obtain an exact analytical expression for Y,,, although the definition does lead to 
some general properties of this function, as will be discussed in $111; here we shall 
consider the calculation of %,, by perturbation theory, using the diagram tech- 
nique. For this purpose, we must express 9%,, in terms of the operators in the 
interaction representation. 
in the interaction representation. 

First, let t > t’. Using the relationship between A(x) and Ain(x) (cf. 102.14)), we 
can write 

Div(x — x') = iX0|A, (x) A,(x')]0) 
= i(0|S(—™, t)Ai(x)S(t, -~)S(—%, t’) x 


x A(x) S(t', —~)|0). 
According to (102.12) we can make the substitutions 
S(t, -20)S(—e, t') = S(t, 1’), 
$(—~, t) = $(—2, +2) §(, t). 
Then 
Dy lx — x') = i(0|S "1S, thaw) S(t, VAP’) S(t’, -%)]]0), (103.3) 
with 
S = S(+m, —0), (103.4) 
Since, according to the definition (102.11), S(t2, t1) includes only operators for times 


+ These concepts were introduced by F. J. Dyson (1949), who also developed essentially the whole 
of the treatment given in this chapter. 
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between t, and t, arranged in chronological sequence, it is evident that all the 
operator factors in the brackets in (103.3) are in order of decreasing time from left 
to right. If the time-ordering symbol T is placed before the bracket, we can 
rearrange the factors in any manner, since the operator T will automatically put 
them in the necessary order. Then we write the bracket as 


[-- -] = TIA (x)AM(x)S(~, t)S(t, t)S(t’, -)] 
= TLA™(x)A™(x')S]. 


Thus 
Dil(x—x')= i(O|S7'TLA R(x) Am (x')S]|0). (103.5) 


It is easily shown by a similar argument that this formula is also valid if t < t’. 

We shall now prove that the factor S~' can be taken outside the averaging over 
the vacuum to form a phase factor. To do so, we recall that the Heisenberg vacuum 
wave function © is the same as the value ®,,,(—~) of the wave function of the same 
state in the interaction representation (see (102.9)). From (72.8), 


S®;,(—) -_ S(+o, —©)Din(—%) = Din(+). 


The vacuum is a Strictly stationary state, in which no spontaneous processes of 
particle generation can occur. In other words, in the course of time the vacuum 
remains the vacuum; this means that ®,,,(+) can differ from ®;,(—%) only by a 
phase factor e'*. Hence 


S$ Din(—) = e'* Din(—%) = (0|S0)Pin(—©), (103.6) 
or, taking the complex conjugate and using the unitarity of the operator S, 
D¥(—%)$~! = (0|S|0) 'OK(—~). 
Hence it is clear that (103.5) can be written 


ie=i\=a TAgi(x)Ay'(x) 5/0) (103.7) 


(0[S|0) 


Substituting in the numerator and the denominator the expansion (72.10) for § 
and averaging by means of Wick’s theorem ($77), we get an expansion of Q&,, in 
powers of e’. 

In the numerator of (103.7), the quantities to be averaged differ from the matrix 
elements of the type (77.1) only in that the ‘external’? photon creation and 
annihilation operators are replaced by A™(x) and A(x’). Since all the factors in the 
products to be averaged are preceded by the time-ordering symbol, the pairwise 
contractions of these operators with the “internal” operators A™(x,), A(x), ... Will 
give the photon propagators D,,. Thus the results of the averaging are expressed by 
sets of diagrams with two free ends, constructed in accordance with the rules in 877, 
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except that propagators D,,, not the amplitudes e of real photons, correspond to 
external (and internal) photon lines. In the zero-order approximation, with S = 1, the 
numerator of (103.7) is simply D,,(x —x’). The next non-zero terms will be 
of the order of e”. They are represented by a set of diagrams having two free ends and 
two vertices: 


-~-<T>--- + (103.8) 


a 


The second of these diagrams consists of two disconnected parts: a broken line 
(corresponding to —iD,,) and a closed loop. The separation of the parts of the 
diagram signifies that the corresponding analytical expression separates into two 
independent factors. On adding to the diagrams (103.8) the zero-order ap- 
proximation diagram (a single broken line) and “taking it outside the brackets’’, we 
find that the numerator in (103.7) is, as far as second-order terms, 


---| - <=>}. cs, oe 


The expression (0|S|0) in the denominator of (103.7) is the amplitude of the 
“transition” from the vacuum to the vacuum. Its expansion therefore contains only 
diagrams without free ends. In the zero-order approximation, (0|S|0) = 1, and as far 
as second-order terms we have 


tr 


When the numerator is divided by the denominator we get, to the same order, the 


expression 
--- 5 ---<D>--- 


Thus the diagram with the detached loop does not occur in the result. This is a 
general theorem. Having regard to the way in which the diagrams are constructed 
which correspond to the numerator and denominator in (103.7), we can easily see 
that the role of the denominator (0|S|0) is simply to ensure that in all orders of 
perturbation theory the exact propagator %,, will be represented only by diagrams 
which do not contain separated parts. 

The diagrams with no free ends, forming closed loops, have no physical 
significance and need not be taken into account, quite apart from the fact that they 
disappear when the propagator @ is formed. Such loops represent radiative 
corrections to the diagonal element of the S-matrix for a vacuum—vacuum tran- 
sition; but, according to (103.6), the sum of all these loops, together with the unity 


QE4-EE 
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given by the zero-order approximation, gives only an unimportant phase factor, 
which cannot affect any physical results. 

The change from the coordinate representation to the momentum representation 
is made in the usual way. For example, in the second-order approximation of 
perturbation theory, the propagator —i%,,(k), which will be shown by a thick 
broken line, is the sum 


t A k + <> k ne) 


in which all the diagrams are calculated by the general rules given in 877 except 
that factors —iD,,(k) are assigned to the external as well as the internal photon lines. 
In analytical form, we therefore havet 


Diy (k) = Dy, (k) + ie’D,(k) i try‘ G(p + k)y’G(p) £?, D,.(k); (103.10) 


the bispinor indices of the matrices y and G are, as usual, omitted. 

The terms in subsequent approximations are constructed in a similar manner, 
and are represented by sets of diagrams having two external photon lines and the 
appropriate number of vertices. For example, the terms in e* correspond to the 
following four-vertex diagrams: 


HO HO D> 


The diagram 


also has four vertices; its upper part is a loop formed by a single “‘self-closed”’ 
electron line. Such a loop corresponds to the contraction W(x)-yws(x), i-e. to the value 
of the current averaged over the vacuum: (O0|j(x)|0). But, by the definition of the 
vacuum, this quantity must be zero identically, and the identity cannot of course be 
altered by any further radiative corrections to such a loop. Thus no diagrams 
having ‘‘self-closed”’ electron lines need be considered in any approximation. 


+ The factor —1 from the closed electron loop must be taken into account when deriving the signs. 
+ Although a direct calculation from the diagrams would lead to divergent integrals. 
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The part of a diagram which lies between two (external or internal) photon lines 
is called a photon self-energy part. In the general case, it can itself be divided into 
parts joined in pairs by a single photon line, i.e. it has a structure of the form 


Ov Om... <O 
where the circles denote parts which cannot be further subdivided in the same 
manner; such parts are said to be compact or proper. For example, the first three 
of the four fourth-order self-energy parts (103.11) are compact. 
Let iF,,/4m denote the sum of the infinity of compact self-energy parts. The 
function #,,,(k) is called the polarization operator. When the diagrams are classified 


by the number of compact parts which they contain, the exact propagator Y,, can 
be put in the form of a series 


Beet Se --/)--- re 
+ -@-@-- +... 


where iP,,,/4 corresponds to each shaded circle. The analytical form of this series is 


een = eo 8-. @ 
9=D+D7Z—~D+Dz_D 4, Pt 
od P De | 
= D{1 +5 |D+ Di D+ |} (103.12) 


where the indices are omitted, for brevity. The series in the brackets is again Q&. 
Hence 


P” (k) 


Duv(k) om D,,(k) bl D,(k) Aq Dov(k). (103.13) 


Multiplying this equation on the left by the inverse tensor (D~')™ and on the right 
by (@7')”’, and renaming the indices, we get the equivalent form 


Ad Pegs (103.14) 


It must be emphasized that writing @ in the form (103.12) assumes that the 
diagrams can be broken down into simpler parts calculated by the general rules of 
the diagram technique, and that the combination of such parts gives the correct 
expressions for the entire diagrams. The admissibility of this breakdown of the 
diagrams is an important and by no means trivial feature of the diagram technique, 
which arises from the fact that the overall numerical factor in the diagram does not 
depend on the order of the diagram. 
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The same property enables us to use the function @ (assumed known) to 
simplify the calculations of the radiative corrections to the amplitudes of various 
scattering processes: instead of treating afresh each time the diagrams with 
different corrections to the internal photon lines, we can simply make these lines 
thick, i.e. assign to them the propagators @ (instead of D) in the appropriate 
approximation. 

If the photon line corresponds to a real and not a virtual photon, i.e. if it is a free 
end of the whole diagram, the application to it of all the self-energy corrections 
gives what is called an effective external line. It corresponds to the expression 
obtained from (103.13) by replacing the factor D by the polarization amplitude of the 
real photon: 


P*(k) 


p 
e). 
4g 2 


e, + D,(k) (103.15) 


For an external-field line, e, in this expression is to be replaced by A‘. 

The discussion in §76 of the tensor structure and the gauge non-uniqueness of 
the approximate propagator D,, applies to the exact function 9,, also. Considering 
only the relativistically invariant representations of this function, we can write it in 
the general form 


Baul) = DER) ( Bun — SES) + DC) A (103.16) 


the first term corresponds to the Landau gauge, and in the second term @" is a 
gauge-arbitrary function. The corresponding form of the approximate propagatorT is 


D,,(k) = D(k?) (5. 2 “ese + DK) Ae (103.17) 


The longitudinal part @” of the propagator is related to the longitudinal part of 
the potential 4-vector, which has no physical significance. It is therefore not con- 
cerned in the interaction and is unaffected by the latter, so that 


B(k?) = Dk’). (103.18) 
The inverse tensors must, by definition, satisfy the equations 
DD” = 8, DD” = 63. 


When the original tensors have the form (103.16) or (103.17), the inverse tensors 
are, from (103.18), 


i -1( ae 1_k,k, 
Bowe = Gy \Bue 4) + DOE 
1 kuk 1_ kk pends 
zl eet nel = v v 
Dw = 55 (Boe EE) + en 


+ In this formula D® is not the same as in (76.3). 
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From these, it follows that the polarization operator Y,, is a transverse tensor: 


P,, = P(k?) (2 - 4), (103.20) 

where P = k?—4n/Q, or 
D(k?) = an (103.21 
=F PII] 21) 


Thus the polarization operator, unlike the photon propagator itself, is a gauge- 
invariant quantity. 


§ 104. The self-energy function of the photon 


In order to examine further the analytical properties of the photon propagator, 
it is useful to define, as well as the polarization operator, another auxiliary function 
Il,,(k), called the self-energy function of the photon: ill,,/47 is defined as the sum 
of all self-energy photon parts (not only the compact ones). If this sum is 
represented in the diagram by a square, we can write the exact propagator as the 
sum 


Ee 


Me Ag 


D, =D, +D Dyn. (104.1) 


Hence, expressing II,, as 


1 


rs Tl, = D',9”D' 5 — Dw 


and substituting (103.16) and (103.19) followed by (103.21), we get 


kk, g 
I, = Tk) (gu = a), =e (104.2) 


Thus II,,, like Y,.,, is a gauge-invariant tensor. 
The usefulness of II,, arises from the expression for it in the coordinate 
representation. This is easily found by noting that the equation 


1 7 zi 
Za Uyw(k) = D'y.D“"{B"(k) — D¥ (KD, 
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in which the tensor #*°— D” is transverse by (103.18), can be written in the 
coordinate representation 


1 C , / I IO , U 
Tuv(X = x) =F (8u 9a — Bur9o9" (914 ~ 8 1p 09 {D(x — x')— D(x — x'J}. 
In order to carry out the differentiation, we must substitute 
Q(x — x’) — D(x — x’) = (O/T A*(x)A?(x’) — TAR(x)AR(x’)0). (104.3) 


In §75 we have seen that the differentiation of a T product generally demands 
caution, because the product has discontinuities. But the difference that is to be 
averaged in (104.3) is continuous, and so are its first derivatives, since the com- 
mutation rules are the same for the components of the operators A*(x) and Aii(x) 
for a given time, and the corresponding discontinuities cancel out (cf. $75). The 
difference in (104.3) may therefore be differentiated under the symbol T. According 
to (102.6) and the corresponding equation with zero on the right for the free 
electromagnetic field operators At.(x), the result is 


I1,,,(x — x") = 4arie2(O|Tj.,(x)j,(x)10). (104.4) 


This shows explicitly the gauge-invariance of II,,, since the current operators are 
gauge-invariant. 

From (104.4) we can derive an important integral form of this function. 
According to (104.2), it is sufficient to consider the scalar function IT = 3114. In the 
coordinate representation, 


II(x — x’) = “2 ie*(O|Tj,,(x)j* (x’)|0) 


> Olj,.)|nhin|j*(x)0) for t>t’, 
aT ie? |. (104.5) 
5 [2 Olina nXn|i"@o)o) for t<t, 


- 
3 


where n labels the states of the system electromagnetic field + electron—positron 
field.f Since the current operator i(x) depends on x" =(t,r), its matrix elements 
also depend on x. The relationship can be found explicitly by taking as the states 
|n) states which have definite values of the total 4-momentum. 

The time dependence of the current matrix elements, like that of any Heisen- 
berg operator, is given by 


(n|j“(t, r)|m) = (n]j*(r)|m)e Fn Ft 


where E, and E,, are the energies of the states |n) and |m), and i(r) is the 
Schrédinger operator. 


+ The current operator conserves charge; hence the states |n) in (104.5) can contain only the same 
numbers of electrons and positrons. 
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To determine the coordinate dependence of the matrix elements, we consider 
the operator j(r) as being the result of transforming the operator j(0) by a parallel 
translation over the distance r. The operator of this translation is exp(ir - P), where 
P is the total momentum operator of the system (see QM, (15.15)). Using the 
general rule for the transformation of the matrix elements (see QM, (12.7)), we 
therefore have 

(n|j"@|m) = (nje"" Pj" (Oe ?|m) 


= (n|j*(O)|m)e'Pn Pr) -F 
Together with the previous formula, this gives finally 
(n|j*(t, r)|m) = (n|j*(0)|m)ye Pm Pos (104.6) 


The matrix (n|j"(0)|m) is Hermitian, like the matrix (104.6) of the entire operator 
]"(t,r), and according to the equation of continuity (102.7) it satisfies the trans- 
versality condition 


(Py — Pn)"{n|j,,(0)|m) = 0. (104.7) 
Let us now calculate the function I(x — x’). Substitution of (104.6) in (104.5) 
gives 


- 2 
mg ="3 SY (Olj,.O)|n)nlj"(O|Oye*§ for 7 ZO, (104.8) 


where x — x’ = € = (1, &). We use the notation 


2 
p(k?) =~ 75 20) 5 Oli, Olny(0|7")|ny*5Ck — P,), (104.9) 


The sum ts taken over all systems of real electron—positron pairs and photons that 
can be generated by a virtual photon having 4-momentum k = (w, k) (w > 0), and for 
each such system there is summation over the internal variables (the polarizations 
and momenta of the particles in the centre-of-mass system).{ After this summation, 
the function p can depend only on k, and since it is a scalar it can depend only on 
k?. In particular, it does not depend on the direction of k. Using these properties of 
p, we can rewrite (104.8) as 


IN(é) = —1 | dw | Gapotkem een 
0 
- J aay | deo d(u)8(w? — k*)p(u)e™ & Wl, 
0 0 


+ This definition of the states |n) is evidently identical with their definition as states for which the 
matrix elements (O|jjn) of a charge-odd operator are non-zero. 
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The momentum representation is obtained by substituting 


co 


—iw|r ° —~ikyt 1 dk 

e ll = iw | e iKo oe (104.10) 
(see §76); the result is 

2) _ 2 2 2 Pe: a p(w’) 
mk) = { d(u) [ d(o3(u? + ¥? - 0) pr, 

0 0 

or, finally, 
» — plu?) du? 
ek) = | >"> (104.11) 


k* — p?+i0° 
0 


The coefficient p in this integral form is called the spectral density of the 
function II(k’), and has the properties 


o(k’?)=0 for k?<0, 
(104.12) 


o(k?)>0 for k’*>0, 
since the 4-momentum k of a virtual photon which can generate a system of real 
particles must necessarily be time-like; k* is equal to the square of the total energy 


of the particles in their centre-of-mass system. The transversality condition (104.7) 
gives 


P*{0|j,,(0)|n) = 0. 
The 4-vector (Q|j|n) is orthogonal to the time-like 4-vector P, and must be 
space-like: 
(O|j,(0)|n XO|j" (0)|[n)* <0; 


thus, from the definition (104.9), p >0. 


§105. The exact electron propagator 
The exact electron propagator, similarly to that of the photon, is defined by 


G(x — x') = —i(O|T yi (x) yi (x')]0) (105.1) 


+ The formal calculations analogous to those given above require caution, on account of the 
presence of the divergences previously mentioned. These give rise, in particular, to thé occurrence on the 
right of (104.11) of further divergent terms which do not have an explicitly relativistically invariant form, 
called Schwinger terms. They will not be written out here, since they in any case disappear on 
renormalization (§110) and do not affect the subsequent results. 
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where i and k are bispinor indices, which differs from the definition (75.1) of the 
free-particle propagator 


Gi (x — x’) = —i(O| Twix) b2(x')/0) (105.2) 


in that the w-operators in the interaction representation are replaced by Heisenberg 
operators. 
The same arguments as were used to derive (103.7) lead to 


Gi (X = x’) ee (0 Ty (x) pe'(x')S 0) (105.3) 


(0[S|0) 


The expansion of this expression in powers of e” puts the $ function in the form of 
a set of diagrams with two external electron lines and various numbers of 
vertices. The denominator in (105.3) again has the function of retaining only the 
diagrams which do not have detached “vacuum loops’’. For example, as far as the 
terms in e*, the graphical representation of the propagator G (denoted by a thick 
continuous line) ist 


+ —<>- + 105.4 
2 a _ Yo. (105.4) 
NL U7 ee, 


The thick continuous line corresponds to the function i9(p) in the momentum 
representation, and the sets of continuous and broken lines in the diagrams on the 
right of the equation correspond to the free-particle propagators iG and —iD 
respectively. 

The section between two electron lines is called an electron self-energy part. As 
with the photon, it is said to be compact if it cannot be further subdivided into two 
self-energy parts by cutting a single electron line. The sum of all possible compact 
parts will be denoted by —iM,; the function M(p) is called the mass operator. For 
example, as far as the terms in e*, 


C4) a AD. (105.5) 


a jor re y, 
~ 


+ It has already been shown in §103 that there is also no need to take account of diagrams which 
contain “self-closed”’ lines; these would here appear in the second order: 


V 


| 
ee ees Cann ee 


470 Exact Propagators and Vertex Parts $105 


By a summation exactly similar to the derivation of (103.13), we find 


G(p) = G(p) + G(p)M(p)G(p) (105.6) 


(omitting the bispinor indices) or, for the inverse matrices, 


G\(p)= G"'(p)— M(p) 
= yp —-m—M(p). (105.7) 


It has already been noted in $102 that the Heisenberg w-operators (unlike those 
in the interaction representation) are altered by a gauge transformation of the 
electromagnetic potentials. The exact electron propagator @ is therefore also not 
gauge-invariant. Its gauge transformation behaviour may be derived as follows (L. 
D. Landau and I. M. Khalatnikov, 1952). 

The change in 4 under the gauge transformation must evidently be expressed in 
terms of the same quantity D as is added to the photon propagator by this 
transformation. This is clear, since in the calculation of @ by the perturbation- 
theory diagrams each term of the series is expressed in terms of the functions D, 
and no other electromagnetic quantities are involved. The analysis can therefore be 
simplified: any special assumptions can be made regarding the properties of the 
arbitrary operator x in the transfromation (102.8), provided that the result is 
expressed in terms of D”. 

The transformation (102.8) brings the propagators @ (103.1) and @ (105.1) into 
the following forms: 


Duy > WO|TLA, (x) — 4.x (x) AL’) — 36x10), 


Gx. > — (OT oxo Me" (x)]0). 


(105.8) 


We shall now suppose that the operators y are averaged independently of all 
the remaining operators in the T product. This is a reasonable assumption, since 
the “‘field’’ ¥ takes no part in the interaction, because of the gauge invariance. We 
also assume that the mean value, over the vacuum, of the operator y is Zero: 
(0|x|0) = 0. Then the terms in x in (105.8) can be separated, and the result is 


Duy > Duy + HOIT A, x(x) - arx(x’)]0), (105.9) 
Gi. > Gy (O/T ee X10), (105.10) 


The rest of the derivation will be given for the case of an infinitesimal 
transformation, and we shall emphasize this by writing dy in place of x. 
The transformation (105.9) may be writtent (independently of the smallness of 


+ Formula (105.11) can be derived from (105.9) if the function d and its derivative with respect to t 
are continuous at t =t'; if they are discontinuous, the right-hand sides of these expressions differ by 
delta-function terms (cf. the derivation of (75.2)). In the momentum representation, this condition is 
equivalent to assuming that d‘°(q) decreases more rapidly than 1/q’ as |q’|> =. 
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5x) as 
Duy >Duv + 6Dyv, SDy, = 0,0)d(x — x’), (105.11) 
where 
d(x — x’) = i(0|TSx(x)5x(x')|0). (105.12) 


Hence it is clear that d“” determines the change caused by the gauge transformation 
in the longitudinal part Z” of the photon propagator. The assumption that d 
depends only on x — x’ implies, of course, a certain limitation on the properties of 
the operator 6x; in the general case of a completely arbitrary gauge transformation, 
the propagator may cease to be homogeneous in space and time. 

In the transformation (105.10), we expand the exponential factors in powers of 
dy as far as the quadratic terms: 


(0|Te iedx(x) 9 ONO) 
~ —3e(0|5x7(x) + x(x’) — 2T 8x (x)5x(x')[0). 


Using the definition (105.12), we thus find the following transformation rule for the 
electron propagator: 


G>G+ 5G, 8G = ie® G(x — x')[dO) — d(x — x')]. (105.13) 


In the momentum representation,t we have 
2 (1) d*q ‘ 
5G(p) = ie” | d'(qi[G(p) — G(p — q)I On) (105.14) 


d(q) is related to the change in the function 2” by 


5D (q) = q?d(q). (105.15) 


+ If the function f(x) = f\(x)f2(x), its Fourier components are 
fp) = | sone a's 


fff aie t ar 4 en CDEC 


=| d*qi d*qo 
(| SedneGe 


(271) 5(p — qi— aa)f (an f2(q2) 


_ { d'a 
=| Shane q). 
In deriving (105.14) from (105.13), we also use the result 


4 
f(x =0) = ( ae 
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An integral expression analogous to (104.11) could be derived for the electron 
propagator, using the expressions 


Wm (X) = Wnm (O)e Pm Pas (105.16) 


for the matrix elements of the p-operator, similarly to the expressions (104.6) for 
the current matrix elements. Unlike the current, however, the w-operators are not 
gauge-invariant. The coordinate dependence (105.16) is therefore not general, but 
applies only to some particular gauge. The same is true as regards the integral 
representation based on (105.16). The deeper physical reason for this situation is 
that the zero photon mass leads to the infra-red catastrophe ($98). In consequence, 
the electron emits an infinite number of soft quanta during the interaction, and this 
means that the “single-particle” propagator (105.1) loses much of its direct 
significance. 


§ 106. Vertex parts 


In complicated diagrams it is possible to distinguish both self-energy parts and 
sections of another type which are not equivalent to them. An important class of 
such sections is found by considering the function 


K 4(x1, X2, X3) = (O[TA" (x1) i(x2) hi. (x3)|0) (106.1) 
which has one 4-vector index and two bispinor indices; since space-time is 
homogeneous, this function depends only on the differences of the arguments x,, 


X2, X3. When expressed in terms of the operators in the interaction representation, 
the function K has the form 


Regi trent = (OTA fie xi) Wi"(x2) hie'(x3)S|0) (106.2) 


(0|S|0) 
The momentum representation is obtained by using the formula 
(2n)'5(p1 + k ~ pa)KR(P2, prs k) = [ [| KkGu, x2, xs)e Hr dx, doe ds, 
(106.3) 


In the diagram technique, the functions K& correspond to three-ended (one 
photon and two electron) sections of the form 


1 
k 


(106.4) 


P2 Pi 
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where the momenta are related by the conservation law 
Pit k =p» (106.5) 


The zero-order term in the expansion of this function is zero; the first-order 
term is 


K* (x1, X2, X3) =e | G(x2— x)y,G(x — x3) - D(x, — x) d*x 


in the coordinate representation, and 
K* (D2, pi; k) = eG(p2)y,G(p,) - D™ (k) (106.6) 


in the momentum representation (omitting the bispinor indices); the corresponding 


diagram is 
MN (106.7) 


In the subsequent approximations, the diagrams are complicated by the addition 
of new vertices, but not all such diagrams provide essentially new information. For 
instance, in the third order we have the diagrams 


© | | ! on 
KR WN LSS 


The first three of these can be cut (across one photon or electron line) into a simple 
vertex (106.7) and a second-order self-energy part; the fourth diagram cannot be 
thus treated. This is a general situation. The corrections of the first kind simply 
replace the factors G and D in (106.6) by the exact propagators § and &%. The 
remaining terms in the expansion give a new quantity to replace the factor y” in 
(106.6). Denoting this quantity by I’, we thus have by definition 


K*" (po, pis k) = {iG(p2)[—iel, (po, pis kK) JiG(pyH- iD” (kK). (106.9) 


A section joined to other parts of the diagram by one photon line and two 
electron lines is called a vertex part if it cannot be divided into parts joined by only 
one (electron or photon) line. The quantity I’ is the sum of an infinity of vertex 
parts, including the simple vertex y”, and is called a vertex operator or vertex 
function. 
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The following are all the vertex-operator diagrams as far as the fifth-order 


quantities: 
! | I | | 
| | | | 
7 ~ACTATATA' 
P) b i d 
\ | i | | 
| | I I | 
+ + e 2 + oe 
ey ian aA _— — Oo 
e f g h i 


the black dot denoting the exact vertex operator —ie!. 

The operator I (like the operator y of the simple vertex) has two matrix 
(bispinor) indices and one 4-vector index; it is a function of two electron momenta 
(Pi, P2) and one photon momentum (k). The three momenta cannot all relate to real 
particles simultaneously: the diagram (106.4) in itself (not as part of a larger 
diagram) would correspond to the absorption of a photon by a free electron, but 
this process is incompatible with the conservation of the 4-momentum of real 
particles. Hence at least one of the three free ends of the diagram must pertain to a 
virtual particle (or to an external field). 

The vertex parts may also be classified as reducible and irreducible. The 
irreducible ones are those which do not contain self-energy corrections to internal 
lines and in which it is not possible to separate parts which constitute (lower-order) 
corrections to internal vertices. For example, of the diagrams in (106.10), the only 
irreducible ones are (b) and (d) (apart from the simple vertex (a)). Diagrams (g), (h) 
and (i) contain self-energy parts; in diagram (c) the upper broken horizontal line 
may be regarded as a correction to the upper vertex, and in diagrams (e) and (f) the 
lateral broken lines may be regarded as corrections to the lateral vertices. 

When the internal lines in irreducible diagrams are replaced by corresponding 
thick lines, and the vertices by black dots, i.e. when the approximate propagators D 
and G are replaced by the exact propagators &@ and &, and the approximate vertex 
operators y by the exact ones I’,f we evidently obtain the set of all vertex parts. 
Thus the expansion of the vertex operator may be written 


; | 
Act At A? (106.11) 


This equation is an integral equation for I’, with an infinity of terms on the right. 
From the above discussion we can easily derive the general principle of 
construction of the exact expressions for sections having any number of ends. 


(106.10) 


+ The resulting diagrams are called skeleton diagrams. 
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They are obtained as vacuum mean values of T products of Heisenberg operators, 
with one operator s(x) for each initial electron, one (x) for each final electron, 
and one A(x) for each photon. 

A further example is given by diagrams of the form 


P3 \ P| 


(106.12) 
Ps 'P2 
with four external electron lines. These are obtained from the function 
Kix, im(X1, X25 X3, X4) = (O|T yi (xi (x2) bi (X3) im (x4)|0), (106.13) 


which, of course, depends only on the differences of the four arguments. Its 
Fourier components may be written 


| K ik, tm (X1, X23 X3, Xae PH tParr Prs-P2%d) d4y d4x5 d*x3 d*x4 


= (27)8(p, + P2— p3— ps) Kix, m(D3, P43 Pi, P2)s (106.14) 
with 


Kix, im(D3, P43 Pi, P2) 
= (27r)*5 (py — p3)Gi(P 1) Gin (D2) — (27)*8(p, — P3)Gim (DP 1) Ga (Pp2) + 
+ Gin(D3) Ger(pa)l— iP nr, st(P3, P43 Pi, P2))Gst(P1) Gim (D2). (106.15) 


In the latter expression, the first two terms exclude from the definition of the 
function I(p3, ps; Pi, P2) diagrams which fall into two disconnected parts, each 
having two free ends: 


r—( a ~—{ 
In the third term the @ factors exclude from the definition of I’ those parts of 
diagrams which are corrections to external electron lines. 


From the properties of the T product of the Fermi w-operators, it follows that 
the functions I(p3, p43 pi, p2) are antisymmetric: 


Vik, m(D3, P43 Py P2) = —V i, m(Pas P33 Pi, P2) 
= —D ix mi(P3, P43 P25 P1). (106.16) 


If the momenta pi, p2, p3, ps correspond to real particles, the non-separating (i.e. 
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connected) diagrams (106.12) represent the scattering of two electrons. The scat- 
tering amplitude is found by assigning the wave amplitudes of the particles (instead 
of the propagators 4) to the free ends of the diagram:t 


iMgi = Ui(p3) tk (ps)[—iel ix, m(P3, P43 Pi, P2)]Ui:(D1) Um (P2). (106.17) 


According to (106.16) this amplitude must have the appropriate antisymmetry with 
respect to interchanges of electrons. 


§ 107. Dyson’s equations 


The exact propagators and the vertex part satisfy certain integral relations, the 
origin of which is particularly clear if the diagram technique is used. 

The concepts of reducibility and irreducibility defined in §106 can be applied not 
only to vertex parts but also to any other diagrams or parts thereof. Let us consider 
from this aspect the compact self-energy electron diagrams. 

It is easily seen that only one diagram out of this infinity is irreducible, namely 
the second-order diagram 


Any complication of this diagram can be regarded as the application of further 
corrections to its internal (electron or photon) lines or to one of its vertices. Here it 
is important to note that, owing to the obvious symmetry of the diagram, any 
vertex correction need by assigned only to either one or the other vertex.t 

Since, therefore, only one of the compact self-energy electron parts is irreduci- 
ble, the ensemble of all such parts (i.e. the mass operator 4) is represented by only 
one skeleton diagram: 


pt+k 
—Ci“)— ~ <> (107.1) 
k 


+ It will be seen later ($110) that the self-energy parts in the free ends can be ignored in deriving the 
amplitudes of real processes. 

+ For clarity, it should be emphasized that, although all the required diagrams are found by applying 
corrections to only one vertex, for any particular diagram the structure of the correction section in 
general depends on the vertex to which it is assigned, for example 


Ret | 
where, in identical diagrams, the squares enclose sections which form the vertex part when it is assigned 
to the right-hand and left-hand vertices respectively. 
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In analytical form, this graphical equation becomest 
M(p) = G"(p)— G"(p) 
ea ene v iv ‘ d‘k 
=—le’ | y"G(pt+k)l"*(p +k, p; k)- D,,(k) On) (107.2) 
A similar expression can be derived for the polarization operator Y. Again only 


one of the compact self-energy photon parts is irreducible, and # is therefore 
represented by a single skeleton diagram: 


p+k 
ee i <= >= (107.3) 
4x k k 
p 
The corresponding analytical equation is 

P Ak - 7 
Sw) = D1,(k)— Dk) 

a5 es d‘p 

=ie°tr | y,G(p + kK) (p + k, p; k)G(p) On” (107.4) 


the bispinor indices are omitted from (107.2) and (107.4). 

The relations (107.2) and (107.4) are called Dyson’s equations; they can also be 
obtained by direct calculation. For example, to derive (107.2) we consider the 
quantity 


(yp — m)iGu(x — x!) = —i(yp — m)n(O|T d(x) yi (xJ]0), 


where p = id is the operator of differentiation with respect to x, which is found 
from (102.5) in exactly the same way as was done when deriving (75.7) for the 
free-particle propagator. The result is 
(yp — M)iGu(x — x’) 
= —ieyi(O|TA,(x)Yi(x)bi(x')]0) + 85 O(x — x’); 


the delta-function term on the right is the same as in (75.7), since the commutation 
properties at t = t’ are the same for W-operators in the Heisenberg and interaction 
representations. The first term is —iey,Ki.(x, x, x'), and we can thus write (again 
omitting the bispinor indices) 


(yp — m)G(x — x’) = —iey"K,,(x, x, x’) + 6 (x — x’). (107.5) 
To obtain the Fourier components we note that, if the definition (106.3) is 


+ If the exact vertex part is assigned to the left-hand vertex in (107.1), the factors y and IV are 
interchanged in (107.2). The two forms of the equation are, of course, essentially equivalent. 


QE4 - FF 
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integrated over d‘*kd‘p,/(27r)°, the result is 


[ K-@ + k, p; k) = | K (0, 0, x3)e 3 d*x3 


om a 
= i K*"(x, x, xe?" d(x — x’), (107.6) 
from which it is seen that the integral on the left is the Fourier component of 


K*"(x, x, x’). Thus, by taking the Fourier components of both sides of (107.5), and 
using the definition (106.9) and the formula yp — m = G~‘(p), we find 


G"'(p)G@(p) =1- ie? | "G(p + kK)" (p +k, p; kK) G(p)Dy(k) Ba ae 


Finally, multiplying this on the right by G'(p), we obtain (107.2). 


§ 108. Ward’s identity 


Another relationship between the photon propagator and the vertex part, 
simpler than Dyson’s equation, follows from gauge invariance. To derive, it, we 
apply the gauge transformation (102.8), assuming that y(x) = 6y(x) is an infinitesi- 
mal non-operator function of the 4-coordinates x. Then the change in the electron 
propagator is 


5G(x, x’) = ieG(x — x’) [by (x) — by (x’)]. (108.1) 


Note that this gauge transformation violates the homogeneity of space-time, and 
the function 54 depends on the arguments x and x’ separately, not only on the 
difference x — x’. Its Fourier expansion must therefore be made in the variables x 
and x’ separately. Thus, in the momentum representation 59 is a function of two 
4-momenta: 


5G(p>, p1) = | | 5G(x, xe? d4y d4x! 


Substituting (108.1) and integrating over d*x d*é or d*éd*x' (€=x-—x’), we get 
5G(p + q, p) = iedx(q)[G(p) — G(p + gq). (108.2) 


With the same gauge transformation, the operator A, (x) is augmented by the 
function 


SA®(x) = — (108.3) 


which may be regarded as an infinitesimal external field. In the momentum 
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representation, 
6A(q) = ig,5x(q). (108.4) 


The quantity 59 can also be calculated as the change in the propagator under the 
action of this field. As far as quantities of the first order in dy, this change can 
evidently be represented by a single skeleton diagram: 


}2 


P+q Pp 


The thick broken line is the effective external-field line, corresponding to the factor 
(see (103.15)) 


opv 
5A®(q) + 6AL(q) oo Di,(q). 


The 4-vector 5A‘(q) is longitudinal (with respect to q) and the tensor #’” is 
transverse. The second term is therefore zero, leaving 


{4 
108.5 
15G(p+q,p)= eel comes ( ) 


P+q P 


where the thin broken line corresponds, in the usual manner, to the field 5A 
simply. In the analytical form, 


6G = eG(p + g)I*(p +g, 3 4) G(p) - SA,”. (108.6) 
Substituting (108.4) and comparing with (108.2), we get 
G(p + q)— G(p) = —G(p + gy" (p + g, D3 Q)G(P) * ay 
or, in terms of the inverse matrices, 
G'(p + q)- "(p) = 4, T*(p +g, Ps q) (108.7) 
(H. S. Green, 1953). 
Taking the limit of this equation as q—>0 and equating coefficients when q, is 


infinitesimal, we get 


aut 


ap, G"(p) =T*(p, p; 0). (108.8) 


This is Ward’s identity (J. C. Ward, 1950). We see that the momentum derivative of 


480 Exact Propagators and Vertex Parts §108 


G”'(p) is equal to the vertex operator with zero momentum transfer.t The derivative 
of the function G(p) itself is 


—Zp7 tS) = 1S (P IE *(D, ps NBD). (108.9) 


The higher derivatives could be found similarly by continuing the calculations 
to higher orders in dy, but we shall not need these expressions. 

Let us now consider the derivative dA(k)/dk, of the polarization operator. 
Unlike §(p), A(k) is gauge-invariant and is unchanged by the application of the 
fictitious external field (108.4). Its derivative therefore cannot be calculated in the 
same way, but a diagram expression can be obtained for this derivative too. To do 
so, we consider the first diagram in the definition of #: the second-order diagram 


pt+k 
eee --= ---~<C>--- (108.10) 


The continuous lines correspond to the factors iG(p) and iG(p + k). Differentiation 
with respect to k replaces the second factor by idG(p + k)/ok, and according to the 
identity (108.9) this change is equivalent to adding a further vertex on the electron 


line: 
K=0 
aa Bg 2 (108.11) 
Pe ok Hk 


We see that, in the first non-vanishing order, the required derivative has been 
expressed in terms of a diagram having three photon ends. It must be stressed 
immediately that this diagram does not itself give the amplitude for the trans- 
formation of one photon into two. The amplitude of this process is the sum of 
(108.11) and a similar diagram in which the loop is traversed in the other direction, 
and the sum is zero by Furry’s theorem. The diagram (108.11) is not itself zero. 

In a similar manner, we can differentiate more complicated diagrams by 
successively adding vertices with k’ = 0 on all the electron lines which depend on k. 
There are, however, diagrams in which the dependence on k occurs in the internal 
photon lines also, for instance the diagram on the left in the next equation: 


i In the zero-order approximation, i.e. for the free-particle propagator, this identity is obvious: 
G‘(p) = yp — m, and therefore dG™'/dp, = y". 
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The derivative of the diagram in the braces is shown here in diagram form by 
means of a new graphical symbolism, a fictitious three-particle photon vertex, i.e. a 
point where three broken lines meet, corresponding to the quantity 


4ri —— = 2ik, = v,. (108.12) 


We can now differentiate any diagram by adding, to the lines depending on k, 
vertices v, or y, and continuing in accordance with the general rules. Summation of 
these higher-order corrections gives 


1 0P., 
= a= Vike, (108.13) 


where ieY,,, 1s the sum of the internal parts of all the diagrams with three photon 
ends thus obtained. 

We shall also need the second derivative of the polarization operator. Differen- 
tiating the equation (108.13) once more in a similar manner, we have 


oe ee (108.14) 
An dk’ak? 9 ~ Mery | Maer 
where ie’ is the sum of the internal parts of all the diagrams with four photon 
ends such as 


0 79 
\ / 
fe) Oo 
ay (108.15) 
ya NX 
/. Ny 


including, of course, those containing the fictitious three-particle vertices (108.12). 


$109. Electron propagators in an external field 


If a system is in a given external field A(x), the exact electron propagator is 
expressed by the same formula (105.1), but in the Hamiltonian H = Hy+ V which 
converts to the Heisenberg representation of operators we have also the interaction 
between the electrons and the external field: 


V= e| Aj d°x +e | Avi d°x. (109.1) 


Since the external field makes space and time no longer homogeneous, the 
propagator G(x, x’) will now depend on the two arguments x and x’ separately and 
not only on the difference x — x’. 


482 Exact Propagators and Vertex Parts § 109 


If we proceed in the usual manner to the interaction representation, the 
ordinary diagram technique is obtained, with external-field lines as well as virtual 
photon lines. This technique is, however, unsuitable when the external field cannot 
be regarded as a small perturbation, in particular when the particles may be in 
bound states in the field. Now the electron propagator in an external field is in fact 
required principally for the analysis of the properties of bound states, and in 
particular for determining the energy levels with allowance for radiative cor- 
rections. In order to derive such a propagator, we have to start from a represen- 
tation of operators where the external field is exactly taken into account even in the 
‘zero-order’ approximation with respect to the electron—photon interaction (W. H. 
Furry, 1951). 

We shall henceforward assume the external field to be independent of time. The 
desired representation of the w-operators is given by the formulae (32.9) for second 
quantization in an external field: 


b(t, r) = > {a,b wrye + bewOMe®*}, 


(109.2) 
W(t nr) = > {ar bP wre’ + baw wre 4, 


where Wr) and e{° are the wave functions and energy levels of the electron and 
the positron respectively, which are solutions of the “‘single-particle”’ problem, 1.e. 
of Dirac’s equation for a particle in a field. It is easily seen that the operators 
(109.2) are w-operators in a certain representation (the Furry representation) which 
iS, aS it were, intermediate between the Heisenberg and interaction representations. 
They may be written 


(109.3) 


A 
— 


p(t, r) = et i(rye it, | 
YOCt,r) = eM baye MM, 


where 
=e e | AM) iG) d°x. 


The electromagnetic-field operator A, of course commutes with the second term in 
H,, and so the Furry representation is the same as the interaction representation 
for this operator. 

The electron propagator in the zero-order approximation, in the new represen- 
tation, is defined as 


GiP(x, x') = —10|T HP) H(x)0). (109.4) 
The operator W(t, r) satisfies Dirac’s equation in the external field: 


[yp — eyA(x) — m]p(t, r) = 0, (109.5) 
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and the function G“ correspondingly satisfies the equation 
[yp — eyA(x) — m]G(x, x!) = 6(x — x’); (109.6) 


cf. the derivation of (107.5). 

The diagram technique, which expresses the exact propagator & as a Series in 
powers of e’, is obtained by changing from the Heisenberg to the Furry represen- 
tation, in exactly the same way as the earlier change to the interaction represen- 
tation. The resulting diagrams are of the same form, with the continuous lines now 
corresponding to factors iG instead of iG. 

One slight difference in the rules for writing the analytical expressions for the 
diagrams arises because in the coordinate representation G® is not a function of 
the difference x — x' only. In a constant external field, however, the homogeneity of 
time is preserved, and so the times t and t’ will again appear only as the difference 
bt 


GG” = G°(7,r,r’). 


The momentum representation is obtained by a Fourier expansion with respect to 
each of the arguments of the function: 


: icici tele de d*p, d° 
G°(7,r,r’) = [fe (P2F-PLk G(s. py, pj) ae aay Tay (109.7) 


Each line corresponding to the factor iG(e, po, p;) must now be assigned one value 
of the virtual energy « and two values of the momentum, the initial value p, and the 
final value p,: 


iG(e, p2, P:) ry Pp2 E pi. (109.8) 


This leads to the rule for writing the analytical expressions, in which the integration 
over de/27 is normal, but those over d’*p,/(27)° and d’p./(27r)’ are independent, the 
conservation of momentum at each vertex being taken into account. For example, 


e—G) 
p—k 
€ p" ie pso€ a 
P2 ed P, 
w,k 


=e {ff G(e, po, p")y"G(e — w, p"~k, p’—k) x 


d‘k d’p' d*p ” 
vei(e) ' 
x y’G'"(e, p’, pi) D,,.(@, kK) 5 oaF Qn) on (109.9) 


It is important to note that in this technique one must also take account of 
diagrams with ‘‘self-closed’’ electron lines, which in the ordinary technique are 
rejected as being associated with a “vacuum current’. When an external field is 
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present, this current need not be zero, because of the “‘vacuum polarization” 
caused by the field. For instance, in the diagram 


PY p+k (109.10) 


the loop at the top corresponds to the factor 


i| | G©(w, p +k, p) Aa. oe. (109.11) 


Here, however, we must still specify the meaning of the integral over dw. This is 
because the integration of the Fourier component of the function G(r) with 
respect to w amounts to taking the value of that function at 7 =0, and G(7) is 
discontinuous at t = 0; we must therefore indicate which of its two limiting values 
is to be taken. To resolve this question, we need only note that the integral (109.11) 
arises from the contraction of w-operators in the same current operator: 


J* = W(t, r)y"Pt, v), 


where is to the left of &. According to the definition of the propagator (109.4), 
this order of factors for t =t’ is obtained if t’ is taken as t + 0, 1.e. if the limiting 
value of the function G“(t — t’) as t —t'>—0 is taken. In other words, the integral 
over dw/27 in (109.11) is to be taken as 


[cee oe HBR. (109.12) 


The mass operator in the external field is defined as in §105: —iM is the sum of 
all the compact self-energy parts. It is now a function of the energy e and the 
momenta p, and p» at the ends of the external lines where they respectively enter 
and leave the part in question: 


(109.13) 


Proceeding exactly as in the derivation of (105.6), we get the equation 


Ge, p2; Pi) = Ge; p2, Pi) 


3 


= [[ GC, pe. DMC. P" PIMP PISESP. 109.14 
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This can be put in a more natural form by returning to the coordinate 


representation in terms of the spatial variables, using the function 


3 3 
erie | | G(e, pr, pietPr Pr") acre (109.15) 


and similarly for the other quantities. Taking the inverse Fourier transform of 
(109.14), we obtain 


G(e,r,r’) — G(e, r, r’) 
= | | GO(e, r, r2)M(e, ro, 11) G(e, r1, ¥") d?x, d?x. 


Next we apply to both sides the operator 
yey p—ey"Ai(x), 
where « is a number, and p = — iV is the operator of differentiation with respect to 
the coordinates r. Here it must be noted that, by (109.6), 
[y°s —y + p— eyA(x)]GCe, r, r’) = 6(r—r’). (109.16) 


The resulting equation is 


[y°s —y > p— eyA(x)] Ge, r, r’) — i M(e, r,11)G(e, m1, 1) d°x; = 6(r—-r’). 
(109.17) 


The function G(e,r,r’) has the especially valuable property that its poles 
determine the energy levels of the electron in the external field. We shall prove this 
first for the approximate function G(e, r,r’). Substituting the operators (109.2) in 
the definition of the propagator (109.4), we obtain, in exact analogy to formulae 
(75.12) for the free-particle propagator, 


—i > PP OPV(r) exp{-ieMH -— tM}, >’, 
GY(t-—t'=.r,r)= _ (109.18) 
iD WP MPR) explieM(t-tY, tt <’, 


and the Fourier time component is 


GPE, r, r') = 2 [ous thas ner) Wii ir) oO} (109.19) 


ae rary ad Se 6-70 


We see that G(e,r,r’), as an analytic function of ¢, has poles on the positive real 
axis which coincide with the electron energy levels, and poles on the negative real 
axis which coincide with the positron energy levels. The values e{)>m form a 
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continuous spectrum,f and the corresponding poles form two cuts in the e-plane, 
from —*© to —m and from m to ~. The segment |e|<_m contains poles which give 
the discrete energy levels. 

For the exact propagator G(e,r,r’) we can obtain a similar expansion by 
expressing it in terms of the matrix elements of Schrodinger operators; the matrix 
elements of Heisenberg w-operators are related to these by 


(m|W(t, r)|[n) = (m| y(n) [nye en (109.20) 


Here the E, are the exact energy levels (i.e. with all radiative corrections) of the 
system in the external field. The operator i increases the charge of the system by 1 
(i.e. by +le|), and ws decreases it by one. This means that in the matrix elements 
(n|W|0) and (O|ys|n) the states |n) must correspond to a charge of the system of +1, 
i.e. they can contain, besides a single positron, only a certain number of electron— 
positron pairs and a certain number of photons; the energies of these states will be 
denoted by ES. Similarly, in the matrix elements (0|W|n) and (n|W|0) the states |n) 
contain one electron and some pairs and photons (energy E%). Instead of (109.18) 
we now have 


i > (Olhi(r)|n Xn|ti@r')[0) exp{-iEY(t-t)}, t >t’, 


Gu(t — t', 4,1) = _ 
i 2 (OlYa(r)|n)<n|pi(r)|0) exp{iEy (t — t)}, t<t’, 
(109.21) 
and hence 
G..(E, r, r’) = » {2 ae ar) 0) (0 uate eld wi(r) 0) ‘} (109.22) 


Let ¢ be close to one of the discrete energy levels ES (or — ES). Then only the 
corresponding pole term need be retained in the sum (109.22). Substitution in 
(109.17) shows that the factors which depend on the second argument r’ (when 
rr’) do not appear in the equation. The result is a homogeneous integro- 
differential equation for the function (O|y(r)|n) (or (n|w(r)|0)), which we denote for 
brevity by V,,(r).4 Omitting the subscript n, we have 


[yve + iy -V— eyAP a Ve) - Mi(e, 4, ri)Vx(r1) d°x;=0 (109.23) 


(J. Schwinger, 1951). The discrete energy levels E, now appear as the eigenvalues 
of this equation. Thus (109.23) becomes the regular basis for determining these 
levels. 


+ We assume that the external field is zero at infinity. 
+ When radiative corrections are neglected, the V,(r) are the same (for states with one electron or 
positron) as the wave functions w%” or wi which are solutions of Dirac’s equation. 
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For example, it can be used to determine the correction, in the first order with 
respect to M, to the discrete electron energy level ¢, given by solving Dirac’s 
equation: 


[y°en tiy -V— eyAO(@)]Yn(r) = 0; (109.24) 


let the wave function y,(r) be normalized by the condition 
| Wn dx = 1. (109.25) 
The eigenfunction of equation (109.23) may be written 
V(r) = Ynlr) + ber), (109.26) 


where w;, is the correction to ,(r). Substituting (109.26) in (109.23), multiplying on 
the left by w,(r) and integratingt over d°x, we get the required expression 


Rise | | alr Mates, rrWard dee, (109.27) 


§110. Physical conditions for renormalization 


The theory discussed so far in this chapter has been largely formal. We have 
treated all quantities as if they were finite, and have deliberately passed over any 
infinities which occur in the theory. In the practical calculation of the functions 9, 
G and I by perturbation theory, however, divergent integrals arise, which cannot 
be assigned any definite values without further consideration. These divergences 
are a manifestation of the logical incompleteness of the existing quantum elec- 
trodynamics. It will be seen below, nevertheless, that in this theory it is possible to 
establish certain rules which allow an unambiguous “subtraction of infinities’’, 
and thus to obtain finite values for all quantities which have a direct physical 
meaning. These rules are based on obvious physical requirements that the 
photon mass is zero and the electron charge and mass are equal to their observed 
values. 

Let us first ascertain the conditions to be imposed on the photon propagator, 
and consider a scattering process which can occur through one-particle inter- 
mediate states having one virtual photon. The amplitude of such a process must 
have a pole when the square of the total 4-momentum P of the initial particles is 
equal to the squared mass of the real photon, i.e. when P* = 0; we have seen in §79 
that this requirement follows from the general condition of unitarity. The pole term 


+ In the integration we use the fact that the differential operator in (109.24) is self-conjugate, and 
thus transfer its action from W{ to tn. 
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in the amplitude arises from a diagram of the form (79.1): 


=O — Ce P (110.1) 
k=P 


and when radiative corrections are taken into account the two parts of the diagram 
must be joined by a thick broken line (the exact photon propagator). This means 
that the function (k”) must have a pole at k* = 0, i.e. must be such that 


QB —>4nZ/k? when k’?>0, (110.2) 
Z being a constant. Hence, for the polarization operator P(k’), (103.21) gives 
P(0) = 0. (110.3) 


The coefficient in (110.2) is given by 
1p.) 
Z k k?30 


Further restrictions on the function (k*) can be derived from an analysis of 
the physical definition of the particle’s electric charge: two classical (i.e. infinitely 
heavy) particles at rest at a large distance apart must interact in accordance with 
Coulomb’s law, U = e’/r. (These are distances much greater than 1/m, where m is 
the electron mass.) This interaction can also be represented by the diagram 


I. (110.4) 


in which the upper and lower lines correspond to classical particles. The photon 
self-energy corrections are taken into account in the virtual photon line. All other 
corrections, affecting the heavy-particle lines, would make the diagram equal to 
zero: the addition of any further internal lines in the diagram (110.4), for example a 
photon line joining the lines a and c or a and b, would produce lines of virtual 
heavy particles, with corresponding propagators. But the propagator of a particle 
has its mass M in the denominator, and tends to zero as M > ~. 

The form of the diagram (110.4) makes it clear (cf. §83) that the factor e?D(k’) 
in it must be (apart from the sign) the Fourier transform of the particle interaction 
potential. Since the interaction is steady, the virtual-photon frequency w = 0, and 
large distances correspond to small wave vectors k. The Fourier transform of the 
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Coulomb potential is 47e7/k?. Since @ depends only on k*= w’?—k’, we finally 
arrive at the condition 


Q—->Anlk? when k?->0, (110.5) 


i.e. the coefficient in (110.2) must be Z = 1; the sign is obvious, since Z(k’) tends to 
the free-photon propagator D(k’). The polarization operator P(k”) must therefore 
satisfy 


P(k*)/k*>0 when k’>0. (110.6) 
This leads not only to the condition (110.3) given previously but also to the result 
P'(0) = 0. (110.7) 


It has been noted in §103 that an effective external real-photon line corresponds 
to the factor (103.15) or, using (103.16) and (103.20), 


1 ni P (DO) |e. 


We now see from (110.5) and (110.6) that the correction term is zero. Thus we have 
the important result that radiative corrections need not be considered in external 
photon lines. 

The natural physical requirements therefore lead to the establishment of 
definite values (namely zero) for the quantities A(O) and #'(0). The calculation of 
these quantities from the perturbation-theory diagrams would lead to divergent 
integrals, and we see that the way to eliminate such infinities is to assign fixed 
values a priori to the divergent expressions, these values being determined by 
physical requirements. This procedure is called renormalization of the quantities 
concerned.t 

The procedure can also be formulated in a somewhat different manner. For 
instance, in renormalizing the particle charge one can define a non-physical 
intrinsic (bare or unrenormalized) charge e, as a parameter which appears in the 
expression for the original electromagnetic interaction operator in formal pertur- 
bation theory. The renormalization condition then becomes e29(k?)— 4e7/k’ 
(when k*>0), e being the actual physical charge. Hence we have the relation 
e*Z =e’, which is used to eliminate the non-physical quantity e, from formulae 
which concern observable effects. By putting immediately Z=1, the renor- 
malization is effected “‘en route’’, and there is no need to use fictitious quantities 
even in the intermediate steps. 

Let us now investigate the conditions for renormalization of the electron pro- 
pagator. To do so, we now consider a scattering process which can take place 


t The idea of this approach was first put forward by H. A. Kramers (1947); the systematic 
application of the renormalization method in quantum electrodynamics is due to Dyson, Tomonaga, 
Feynman, and Schwinger. 


490 Exact Propagators and Vertex Parts §110 


through a one-particle intermediate state with one virtual electron. The amplitude 
of such a process must have a pole when the square of the total 4-momentum P; of 
the initial particles is equal to the squared mass of the real electron, i.e. when 
P?=m_’. The pole term in the amplitude arises from a diagram of the form 


P, (110.8) 


and when radiative corrections are taken into account the thick line is the exact 
electron propagator. This means that the function 4(p) must have a pole at 
p’? =m’, i.e. its limiting form there must be 


Wp)~ Ze +g(p) when p?>m?, (110.9) 


—m’+i0 


Z, being a scalar constant and g(p) remaining finite as p*>m*. The matrix 
structure of the pole term in (110.9) (proportional to yp + m) is a consequence of 
the same unitarity condition that causes the existence of the pole. We shall prove 
this statement and at the same time elucidate the important question of the 
renormalization conditions for the external electron lines. 

If G(p) has the limiting form (110.9), the inverse matrix is 


: 1 
G\(p) = z, (yp —m)—(yp —m)g(yp —m) when p’>m*. (110.10) 
The mass operator is 
M=G"-G~(1-5-)(yp —m) + (yp — m)g(yp—m) when p?-> m’, 
1 
(110.11) 


The effective external (say incoming) electron line corresponds (cf. (103.15)) to a 
factor 


U(p) = u(p)+ G(p)M(p)u(p), (110.12) 


where u(p) is the ordinary amplitude of the electron wave function, which satisfies 
Dirac’s equation (yp —~m)u=0. Because of the requirements of relativistic in- 
variance (VU, like u, is a bispinor), the limiting value of U(p) for p?—> m’ can differ 
from that of u(p) only by a constant scalar factor: 


U(p) = Z'u(p). (110.13) 


This factor Z’ is related in a definite manner to the factor Z;, but the relation 
cannot be determined simply by substituting (110.10) and (110.11) in (110.12), 


§110 Physical Conditions for Renormalization 49] 


because there is an indeterminacy; the result depends on the order in which the 
limits of the various factors in (110.12) are taken. 

It is, however, possible to avoid the problem of the correct method of taking the 
limit, by using instead the unitarity condition for the reaction shown by the diagram 
(110.8). The unitarity relation generally applies to amplitudes of processes as a 
whole, not to individual diagrams. But when p’—> m7’ the pole diagram (110.8) gives 
the main contribution to the corresponding amplitude M;;,, so that the other 
diagrams which pertain to the same reaction can be ignored. 

As has been shown in §79, the unitary conditions require that a one-particle 
intermediate state should produce in the reaction amplitude an imaginary part with 
a delta function: 


in5(p?—m’) > M,,M#, (110.14) 


polar. 


where the subscript n refers to a state having one real electron, and the summation 
is over the latter’s polarizations; to avoid additional complications we assume, as in 
$79, that both sides of the unitarity relation are symmetrized with respect to the 
helicities of the initial and final particles, so that Mj; = Mi. The amplitude Mj, 
corresponds to a process represented by the diagram 


PB = 


Myn = (M 7, U) = Z'(M jn), 


and is 


whereM is a factor with one free bispinor index.t Similarly, the structure of the 
amplitude M;, is 


Mj, = (UM jn) = Z'GM jn). 


Summation over the polarizations of the electron replaces the product (M j,u) x 
(iM in) by M;,(yp + m)Mjp, and so the term (110.14) in the amplitude Mj, becomes 


Z"in8(p?— m2{M j,(yp + m)M et. 


Using this term in the imaginary part, we can reconstruct the entire pole term in the 
scattering amplitude; from (79.5), 


12 i it 
M; = _Z {M ¢n(-yp + mM in} p?> m*. 


p*—m’*+i0 


+ One point should be clarified here. The electron, a stable particle, cannot really be transformed 
into an assembly of real particles, but we may formally take as the latter certain fictitious particles 
whose masses are such as to allow this transformation. The resulting relationship is then to be taken as 
an analytical continuation to real masses. 
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Calculation of the same amplitude directly from the diagram (110.8) gives 
iMy: = iM jn - iG(p) - iM in 


A comparison of the two formulae confirms the limiting expression written above 
for G(p) (the first term in (110.9)), and shows that 


Z'=VZ,. (110.15) 


We shall now show that, when the limiting form of the electron propagator is 
known, there is no need to establish any further conditions for the vertex operator. 


Let us consider the diagram 
ran (110.16) 
P, Pi 


which represents the scattering of an electron in an external field A‘(k), in the first 
order with respect to the field, and taking account of all radiative corrections. In 
the limit k > 0, p2— p; =p, the self-energy corrections to the external-field line are 
zero (since they vanish for any k?=0). Then the diagram corresponds to the 
amplitude 


My = —eU(p)T(p, p; 0)U(p)- AP (k>0), (110.17) 


i.e. the product of the potential A® and the electron transition current UP"U. But 
when k >0 the potential A(x) reduces to a constant independent of coordinates 
and time. No physical field corresponds to this potential (a particular case of gauge 
invariance), which therefore can cause no change in the electron current. Thus, in 
the limit considered, the transition current %I% must be simply the free current 
aiyu: 


Up)" (p, p; 0)U(p) = Z,u(p)T*u(p) 
= u(p)y*u(p). (110.18) 


This is essentially also a definition of the physical charge on the electron. It is 
easily seen to be necessarily satisfied, whatever the value of Z,: substituting 9" '(p) 
from (110.10) in Ward’s identity (108.8), we find 


P*(p, p; 0) = z y" — y"2(p)(yp — m)— (yp — m)g(p)y", 


and (110.18) is satisfied, since (yp — m)u = 0, u(yp — m)=0. 
We see that, when the amplitude of the physical process is calculated, the 
‘renormalization constant’? Z, disappears. Moreover, by using the indeterminacy 
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that arises from divergences in the calculation of I, we can simply require that 


i(p)T"(p, p; 0)u(p) = a(p)y"u(p) when p*=m’, (110.19) 


le. put Z, = 1. 
The convenience of this definition lies in the fact that there is no need to apply 
corrections to the external electron lines: we have simply 


U(p) = u(p). 


This can also be deduced directly by noticing that for Z,; = 1 the mass operator (110.11) 
is 


M = (yp —m)g(yp — m) (110.20) 


and the second term in (110.12) obviously vanishes. Thus there is no need to 
‘‘renormalize”’ the external lines of any real particles, either photons or electrons.t 


$111. Analytical properties of photon propagators 


It is convenient to begin the study of the analytical properties of the photon 
propagator with the function II(k’). The reason is that the direct use of the 
definition (103.1) for this purpose is difficult because the operators A“(x) are 
gauge-ambiguous and their properties are therefore indeterminate. 

The integral representation of the function II(k”) (104.11) was derived from the 
expression for the photon self-energy function in terms of the matrix elements of 
the gauge-invariant current operator. Denoting the variable k’ byt t, let us consider 
the properties of the function II(t) in the complex t-plane. 

From the integral representation 


_ ptt’) dt 
me) = | AOS. (111.1) 
0 


we see that II(t) is real on the negative real axis, and elsewhere in the plane 
satisfies the symmetry relation 


II(t*) = I*(t). (111.2) 


The function II(t) can have singularities only at singular points of p(t). These 
are at values of t = k” which are threshold values for the creation of various groups 


+ In renormalizing the photon propagator, the condition Z=1 arose as a necessary physical 
condition, after which the corrections to the external photon lines disappear automatically. Formally, 
however, the situation is similar for both photon and electron external lines: when Z# 1, the wave 
amplitude e, of a real photon, with corrections, would be multiplied by V Z. 

+ Not to be confused with the symbol for the time. 


QE4 - GG 
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of real particles by a virtual photon. At these values, new types of intermediate 
states ‘“‘come into play” in the sum (104.9). Their contribution is zero below the 
threshold, but not zero above the threshold, and this causes the singularity at the 
threshold itself. These threshold values are, of course, real and non-negative.t The 
singularities of II(t) therefore also lie on the positive real t-axis. If a cut is made 
along this axis, the function II(t) is analytic throughout the cut plane. 

The term +i0 in the denominator of the integrand in (111.1) shows that we must 
pass below the pole t’=t. This means that the value of II(t) for real t must be 
taken as its value on the upper edge of the cut. Using the rule (75.18): 


=P 1 = in8(x), (111.3) 


we find that for real t 
im II(t) =im I(t + i0) = — zp(t). (111.4) 


On the lower edge of the cut, im IT has the opposite sign; re II is the same on both 
edges. Thus the discontinuity of I(t) at the cut is 


I(t + 10) — I(t — 10) = —2ip(t). (111.5) 


The integral representation (111.1) itself can in this way be regarded simply as 
Cauchy’s formula for the analytic function II(t): applying this formula 


1 f W(t’) dt’ 


2ni J t'-t oe) 
Cc 


Il(t) = 


to the contour 


©) 


(111.7) 


which passes around the cut, and assuming that II(t) decreases sufficiently rapidly 
at infinity, we find that the integral along the large circle is zero, and those along the 
edges of the cut give the following dispersion relation between II(t) and its 


+ For example, the point k? = Ois a threshold for the production of three (or a higher odd number of) real 
photons; k’ = 4m? is a threshold for electron-positron pair production, and so on. 
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imaginary part: 


I(t) = 3 joes + iO) 


dt’ 


-2 (209, im ae 5 dt (111.8) 


Substitution of (111.4) then gives (111.1).t 

The analytical properties of the functions A(t) and Z(t) are the same as those 
of II(t), in terms of which they are expressed by the simple formulae (104.2) and 
(103.21). For Z(t) we have 


a(t) = 72 (1+), (111.9) 


On the positive real t-axis, we must take t as t + i0, as shown above. The imaginary 
part of &(t) can then be calculated from (111.3) and (111.4), bearing in mind that 
II(t)/t >0 when t > 0, by (110.6). We thus find 


im D(t) = —477°6(t) + (427/t”) im T(t) 
= —477°8(t) — (47r7/t) p(t). (111.10) 


Now, applying to &(t) a dispersion relation of the form (111.8), we obtain the 
integral representation 


FC eee ey [et (111.11) 


ae t’'+ id’ 


called the Kallén—Lehmann expansion (G. Kallén, 1952; H. Lehmann, 1954). 

There is a close relationship between the position of the cut for the function 
Y(t) (and therefore its imaginary part on the cut) and the unitarity condition for the 
amplitude of the process a+b—->c+d represented by the diagram (110.4); this 
reaction is, of course, a purely imagined one, but it does not violate the con- 
servation laws, and the unitarity condition is formally valid for it. 

In the initial state i of this process there are two “classical” particles a and b, 
and in the final state there are two other such particles c and d. The unitary 
condition is (71.2)+ 


T, — Te = i(27)* D> TynT i,8(P; — Pi): (111.12) 


+ Dispersion relations were first used in quantum field theory by M. Gell-Mann, M. L. Goldberger 
and W. E. Thirring (1954). 
+ The amplitudes T; differ from the Mj, only by the factors shown in (64.10). 
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the summation on the right is taken over all the physical “intermediate” states n. In 
the present case these states are evidently those of the systems of real pairs and 
photons which can be created by the virtual photon k, 1.e. the states which occur in 
the matrix elements in the definition (104.9) of the function p(k’). The amplitudes 
M,; and M include the factors D(k’*) and B*(k’) respectively, and their difference 
contains im @(k’). We see, therefore, that the relation given by (111.4) between an 
imaginary part of @ and the existence of these intermediate states is a consequence 
of the necessary requirements of unitarity. 

We shall see later that it is convenient, in practical calculations of &(t) (or, 
equivalently, ?(t)) by perturbation theory, to begin by finding the imaginary part of 
P, which does not involve divergent expressions. But if A(t) is then calculated 
from a dispersion relation of the type (111.8), the integral diverges and further 
subtraction operations are necessary in order to Satisfy the conditions A(0) = 0 and 
P'(0) = 0. This subtraction can, however, be effected without the explicit use of 
divergent integrals. To do so, we need only apply the dispersion relation (111.8) not 
to P(t) itself but to A(t)/t’. Then we have 


1.6) 


at im A(t’) 
P(t) = | | gs dt’. (111.13) 
0 


This integral is convergent, and the function Y(t) thus obtained must necessarily 
satisfy the required conditions. A relationship such as (111.13) is called a “‘double- 
subtraction” dispersion relation. The significance of the change to A(t)/t? becomes 
especially clear if (111.13) is written in the form 


oO 


y= — | ae 2A, a+ [Ee SO at + | EO ZED apr 
aw) t'—t—i0 7 t 7 t 
0 0 0 


If the first (“non-regularized”’) integral is denoted by P(t), the right-hand side is 
P(t) — P(O) — tP'(0). 


§112. Regularization of Feynman integrals 


The physical conditions of renormalization discussed in §110 enable us, in 
principle, to derive a unique finite value for the amplitude of any electrodynamic 
process in any approximation of perturbation theory. 

Let us first of all ascertain the nature of the divergences that occur in the 
integrals derived directly from Feynman diagrams. This is considerably facilitated 
by counting the powers of the virtual 4-momenta which appear in the integrands. 

Let us consider a diagram of order n (i.e. one containing n vertices), with N, 
external electron lines and N, external photon lines; N, is even, and the electron 
lines form 4N, continuous sequences, each beginning and ending at a free end. The 
number of internal electron lines in each such sequence is one less than the number 
of vertices in it; the total number of internal electron lines in the diagram is 
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therefore n —3N,. One photon line comes to each vertex: at N, vertices the photon 
line is external, and at the other n — N, it is internal. Since each internal photon line 
joins two vertices, the total number of such lines is 4(n — N,). 

Each internal photon line is associated with a factor D(k), which contains k to 
the power —2. Each internal electron line is associated with a factor G(p), which 
contains p to the power —1 (when p’> m7’). Thus the total power of the 4-momenta 
in the denominator of the diagram is 2n —3N, — N,. 

The number of integrations (over d‘*p or d‘*k) in the diagram is equal to the 
number of internal lines minus the number (n — 1) of additional conditions imposed 
on the virtual momenta (of the n conservation laws at the vertices, one relates the 
momenta at the free ends of the diagram). Multiplying by 4, we obtain the number 
of integrations over all the 4-momentum components, 2(n — N, — N, + 2). 

Lastly, the difference r between the number of integrations and the power of 
the momenta in the denominator of the integrand is 


r=4-3N,—-N,, (112.1) 


and is independent of the order n of the diagram. 

The condition r<0O for the diagram as a whole is not in general sufficient for 
convergence of the integral: the corresponding numbers r’ for the internal sections 
which can be taken from the diagram must also be negative. The existence of 
sections having r’ > 0 would make them divergent although the other integrals in 
the diagram would converge “with something to spare’. The condition r<0 is, 
however, sufficient for the convergence of the simplest diagrams, in which n = 
N. + N, and there is only one integration over d‘p. 

If r=0, the integral always diverges. The order of divergence is at least r if r is 
even, and at least r—1 if r is odd; the decrease by one in the latter case is due to 
the vanishing of the integrals over all 4-space of products of an odd number of 
4-vectors. The order of divergence may be higher if there are internal sections with 
r’'>0. 

Since N, and N, are positive integers, we see from (112.1) that there exist only 
a few pairs of vaiues of N, and N, for which r=0. The simplest diagrams of each 
such type may be enumerated, omitting the cases N, = N, = 0 (vacuum loops) and 
N,. =0, N, =1 (mean value of the vacuum current), since they have no physical 
meaning and the corresponding diagrams must be rejected, as already shown in 
§103. The remaining cases are: 


‘\\ ra (112.2) 
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In (a) the divergence is quadratic; in all the others (r = 0 and r = 1) it is logarithmic. 
The diagram (112.2d) is the first correction to the vertex operator. It must 
satisfy the condition (110.19), which we shall here write as 


i(p)A“(p,p;0)u(p)=0 when p*=m’, (112.3) 
with 
AM =T* = yH, (112.4) 


Let A“(p2, pi; k) be the Feynman integral as derived directly from the diagram. 
This integral is logarithmically divergent, and does not itself satisfy the condition 
(112.3), but we can obtain a quantity which does satisfy this condition by taking the 
difference 


A*(p2, pi; k) = A“(p2, pis k)- [A*" (pi, Pis 0) ]p2=m?. (112.5) 


The leading divergent term in A“(p2, pi; k) is obtained by taking the virtual photon 
4-momentum f in the integrand to be arbitrarily large. It ist 


_y 2 fVoOpy*Ony df 
trie? | Pf -f Ony 


and is independent of the 4-momenta of the external lines. In the difference (112.5) 
the divergence therefore cancels, leaving a finite quantity. This operation of 
removing the divergence by subtraction is called regularization of the integral. 

It must be emphasized that the integral A“(p2, pi; k) can be regularized by one 
subtraction because here the divergence is only logarithmic, i.e. as weak as it can 
be. If the integral involved divergences of various orders, a single subtraction with 
k = 0 might be insufficient to eliminate all the divergent terms. 

When the first correction in I (i.e. the first term in the expansion of A”) has 
been determined, the first correction in the electron propagator (the diagram 
(112.2a)) can be calculated from Ward’s identity (108.8), which may also be written 
in the form 


dM(p)/dp, = A*(p, p; 9), (112.6) 


with the mass operator in place of G, and A” in place of I“. This equation is to 
be integrated with the boundary condition 


ii(p)M(p)u(p)=0 for p’=m’, (112.7) 
which follows from (110.20). 
Finally, to calculate the first term in the expansion of the polarization operator, 


we use the identity (108.14), which after contraction with respect to two pairs of 


+ The complete expression for the integral is given in (117.2). 
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indices gives 


a OP 
4a 0k,ok° 


= 27, 


a relation between the scalar functions P =3;P%4 and F = ¥%. Both these functions 
depend only on the scalar variable k?, and so we have 


2k? P"(k?) + P'(k*) = = Fk’), (112.8) 


the primes denoting differentiation with respect to k*. With the condition ¥’(0) = 0, 
this equation shows that 


S(0) = 0. (112.9) 


In the first approximation of perturbation theory, S(k’) is given by the diagram 
(112.2e), with the free ends having 4-momenta k, k, 0, 0. The corresponding Feyn- 
man integral Y(k*) diverges logarithmically, and can be regularized by a single 
subtraction, using the condition (112.9): 


S(k?) = f(k’?) — F(0). 


Then Y(k’) is found by solving equation (112.8) with the boundary conditions 
P(0) = 0, P'(0) = 0. 

In the next approximation of perturbation theory, the correction to the vertex 
operator A? is determined by the diagrams (106.10, (c)-(i)). The irreducible 
diagrams (d)-(f) are calculated by a similar regularization of the integrals, using a 
single subtraction as in (112.5), in the same way as in calculating the first- 
approximation correction A‘). In the reducible diagrams, the internal self-energy 
and vertex parts of lower order are immediately replaced by the already known 
(regularized) first-approximation quantities (AP, M, A‘), after which the integrals 
obtained are again regularized in accordance with (112.5).t The corrections ?™ and 
M® can then be calculated from (112.6) and (112.8). 

This systematic procedure will in principle enable us to derive finite values for 
P, M and A, in any approximation of perturbation theory. It thus becomes possible 
to calculate the amplitudes of physical scattering processes that are described by 
diagrams containing ?, & and A, as constituent sections. 

The physical conditions derived in §111 are therefore sufficient for an unam- 
biguous regularization of all the Feynman diagrams occurring in the theory. This 
renormalizability is a far from trivial property of quantum electrodynamics.+ 

In a practical calculation of radiative corrections, the above procedure may, 
however, not be the simplest and most rational. In Chapter XII we shall see, in 


+ In diagrams for still higher approximations, it may be necessary to replace the four-ended sections 
f¥ also by already regularized values. 

+t A different approach to renormalization theory in quantum electrodynamics is given by N. N. 
Bogoliubov and D. V. Shirkov, Introduction to the Theory of Quantized Fields, Wiley, New York, 1980. 
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particular, that a convenient treatment may start by calculating the imaginary parts 
of the corresponding quantities; these are given by integrals which do not involve 
divergences. The quantity itself is then found by analytical continuation, using the 
dispersion relations. It thus becomes possible to avoid the lengthy calculations 
which are needed in a direct regularization by subtractions. 


CHAPTER XI 


RADIATIVE CORRECTIONS 


§ 113. Calculation of the polarization operator 


LET us now go on to the actual calculation of the radiative corrections, and begin 
with that of the polarization operator (J. Schwinger, 1949; R. P. Feynman, 1949). In 
the first approximation of perturbation theory, this is given by the loop in the 


diagram 
aa ae Sate 113.1 
k a k ( ) 


As already mentioned, the problem becomes easier if we first calculate the 
imaginary part of the required function. This in turn is most conveniently done by 
using the unitarity relation. The virtual-photon lines are regarded as corresponding 
to a fictitious ‘“‘real’”’ particle, a vector boson with mass M* =k’ which interacts 
with an electron in the same way as a photon does. Then (113.1) becomes the 
diagram of a “‘real’’ process, and the unitarity condition can be justifiably applied to 
it. 

Thus we regard (113.1) as a diagram giving the amplitude of the transition of the 
boson into itself (a diagonal element of the S-matrix) via a decay into an electron- 
positron pair. The crosses in the diagram show where it is to be cut into two parts 
so as to show the intermediate state that figures in the application of the unitarity 
relation. This state contains an electron with 4-momentum p- =p and a positron 
with ps = —(p —k). 

The unitarity relation with a two-particle intermediate state (71.4), for coin- 
cident initial and final states, gives 


2im My = | IMyi|°do. (113.2) 
res E polar. 
Here the amplitude M;;, from (113.1), is 
ais 
iMy = V(4r)et- V(r e, S—, (113.3) 


where eé, is the boson polarization 4-vector, which according to (14.13) satisfies the 
equation 


e,k* = 0. 
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The amplitude M,; corresponds to the diagram for the decay of the boson into 
an electron—positron pair: 


A 
P= ~P+ 
The corresponding expression 1s 
Mn =~ eV (4rr)e,j", j* = a(p_)y"u(— ps). (113.4) 


Substitution of (113.3) and (113.4) in (113.2) gives 


2 
2e*e, im PY’ = se » | irrivete, do. (113.5) 
Aa E polar 
Here p= p_- = — p, and ¢ = e, + e_ = 2€, are the momenta and total energy of the pair 


in its centre-of-mass system; the integration is over the directions of p, and the 
summation is over the polarizations of the two particles. 

Let us now average both sides of (113.5) over the polarizations of the boson. 
This is done by means of the formula (14.15): 


Since the tensor #”” and the vector j" are transverse (A**k, = 0, j*k, =0), we have 
as the result 


gs | |p| | + 
2im P = 75- ; 7 (jj*) do, (113.6) 


where P =3P%. 


The summation over the polarizations is effected in the usual manner, the 
integration over do reduces to a multiplication by 47, and so 


2im 2 = e7 Plt y.(yp- + m)y"(yp+—m) 
8 
= — e? Pl (p.p_+.2m?), 


In terms of the variable 


t=k’=(p,+p_)’ 
= 2(m* + psp-), (113.7) 


$113 Calculation of the Polarization Operator 503 


we have 


and hence finally 


_ 2 
im P()=-tay/! am (t+2m?), t=4m?. (113.8) 


The value t = 4m’ is the threshold value for the production of one electron- 
positron pair by a virtual photon (cf. the second footnote to §111); in the 
approximation considered (~e’), the state with a single pair is the only one that 
can appear as an intermediate state in the unitarity condition (113.2). In this 
approximation, therefore, the right-hand side of (113.2) is zero for t <4m7’, and 
hence 


im P(t)=0, t<4m’. (113.9) 
For the same reason, in this approximation the cut in the complex t-plane for 


the function A(t) extends only from t = 4m’ on the real axis, and this point must be 
the lower limit in the dispersion integral (111.13). Thus 


ioe) 


O98 dt’ VEE t'+ 2m?’ 
P(t) 37 t | yop 0 i rc mae (113.10) 


4m 


It is convenient for the expression of the result to replace t by a variable é, defined 
as follows: 


tim? = —(1- €)*/é. (113.11) 


This transformation maps the upper half-plane of t on a semicircle of unit radius in 
the upper half-plane of €, as shown in Fig. 19, where corresponding line segments 
in the two planes are indicated by similar markings. The semicircle € = e*, O< @ <7, 


Fiac. 19. 


corresponds to the non-physical region 0 < t/m* <4. The right-hand and left-hand 
radii on the real axis correspond to the physical regions t <0 and t/m’ > 4. 
The integral in (113.10) is most simply calculated by means of the substitution 


l 
Ane 


t = 
m 1-x 
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and we first take the case t <0 (so that the denominator does not vanish in the 
range of integration and the imaginary term i0 may be omitted). The result of the 
integration, in terms of the variable €, is 


p= {23 (est) + (E+2-4) jos gf. (113.12) 


The analytical continuation of this formula gives the function P(t) in the range 
t >4m?’ also; this is done by putting € =|é|e'” (and the logarithm gives a con- 
tribution to the imaginary part: log é = log |é|+i7).t For the non-physical region, 
we must put € =e”, and then 


Zam mt {- 


P(t) = » sin’?3h —2+(1+2 sin’?3¢)¢ cotrd}, (113.13) 


tl4m? = sin’ 3¢. 
In the limit of small |t| (i.e. > 1), these formulae become 


a 


2 
P(t) = — aE“, It|<4m?. (113.14) 


In the opposite case of large |t| (i.e. €>0), we have 


P(th= Aa |t| log uh —t>4m’, 


(113.15) 
P(t) == t (log =5— in), pan. 


In accordance with the significance of perturbation theory, these formulae are 
valid if P/4ma <D™' = t/4z. The condition for (113.15) to be applicable is thus 


oa, ltl 
3, oS <1. (113.16) 


The radiative corrections which involve a log (|t|/m) are called logarithmic cor- 
rections. 


$114. Radiative corrections to Coulomb’s law 


Let us apply the formulae derived above to the problem of the radiative 
corrections to Coulomb’s law. These corrections may be intuitively described as 
resulting from the polarization of the vacuum around a point charge. 


+ The analytical continuation thus obtained is, as it should be, a continuation on the upper edge of 
the cut, since the semicircle in the €-plane corresponds to the upper half-plane of t. 
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If corrections are neglected, the field of a fixed centre (with charge e,) is given 
by the Coulomb scalar potential ® = Af =e,/r. The components of its three- 
dimensional Fourier expansion are 


@(k) = A®(k) = 47re,/k’. 


When the radiative corrections are included, this field is replaced by the “effective 
field”’ 
op P” 


AP = AL + Dog G— A= AP +7 - 


| og A®: (114.1) 


cf. (103.15). The second term gives the required change in the scalar potential. In 
the first approximation of perturbation theory for #(k’), we must take the expres- 
sion derived in §113 and replace B(k*) by the zero-order approximation: 


B(k?) = D(k’) = — 4a/k’. 
Thus the radiative correction to the field potential is 


Are 
(k’)’ 


5D(k) = —- P(—k’). (114.2) 


To determine the form of this correction in the coordinate representation, we 
must take the inverse Fourier transform: 


dD(r) = | e*"§@(k) d°k/(271)’. (114.3) 


Since 5®@(k) is a function of t = —k’ only, integration over angles gives 


8O(r) = 7 L, [ sap BO) a= t) 


1 . ir 
= 472, 1m i 5@(— y*) e'”y dy; 


in the last transformation, we use the fact that the integrand is an even function of 
y = V—t. The contour of integration can now be moved into the upper half-plane 
of y, and made to coincide with the cut of the function P(—y’) (Fig. 20). This cut 


Fic. 20. 


506 Radiative Corrections §114 


extends upwards along the imaginary axis from the point 2im, the physical sheet 
corresponding to the left side of the cut. Replacing y by a new variable, y = ix, we 
get 


oO 


d@(r) = —— | im 6@(x’) e-™x dx. 


2m 


Finally, on returning to the integration over t = x’, we have 


oO 


fa | im 8@(t) e~”™'! dt. (114.4) 


4m 


The imaginary part 


im $@(t) = - "ze im P(t) 


is taken from (113.8), and after an obvious change of variable we have 


oO 


Br) = 4 3(r) = 2 (142% f erm(1 4 she) MED Vr dr} (114.5) 


1 
(E. A. Uehling and R. Serber, 1935). 
The integral can be evaluated in two limiting cases. Let us first take that of 


small r (mr <1), and divide the integral of the first term in the parenthesis into two 
parts: 


ie ven) 
l= e 2mr¢ Da 


a o 


a ees ee 


1 a 


=[,+ I, 


with ¢; chosen so that 1/mr>¢,>1. Consequently, we can take r = 0 in the first 
integral, so that 


C1 
2 
n~{[¥& dé 


=log2¢,—1. 
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In I;, on the other hand, unity may be omitted from ¢?— 1: 


ie) 


=—log f,-e7"'+2mr | el log £ dé. 
1 


In the exponential and the lower limit, it is permissible to put ¢, = 0. Then, with the 
change of variable 2mr¢ = x, we have 


0 


L= log 2t, tog + | &> log x dx 
0 


= log 2¢, + log —C, 


where C = 0.577... 1s Euler’s constant. 
In the integral of the second term in (114.5) we can immediately put r = 0: 


00 7 
n~z [Yo Pac=t 
1 


When the three integrals are added, ¢, disappears, leaving 


2 [,,2¢(/,, 1 -2\| Z 
o(r) = [1 +3 (log C =) Pili. (114.6) 


When mr > 1, the range £— 1~1/mr <1 is important in the integral. The change 
of variable ¢ = 1+ € and appropriate transformations reduce it to 


foe) 


Cre | "as ; V(2€) dé _ uma Var enn 


0 


In this case, therefore,f 


r 


e Qa e” 
DO =S (1+ Rr om), > Um, (114.7) 


We see that the polarization of the vacuum alters the Coulomb field of a point 


+ The origin of the factor e~’”” in 8®(r) is evident from the form of the initial integral (114.4): when 
r is large, the important values of t are those near the lower limit. Thus the exponent is determined by 
the position of the first singularity of the function 5@(t). 
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charge in a region r~ 1/m(=h/mc), where m is the electron mass. Outside this 
region, the change in the field decreases exponentially. 

One further general comment may be made. Hitherto we have implicitly 
assumed that the radiative corrections arise from the interaction between the 
photon field and the electron—positron field. Thus, by associating the internal closed 
loops in the photon self-energy diagrams with the electrons, we have taken into 
account the interaction of the photon with the “electron vacuum’’. But the photon 
also interacts with the fields of other particles; the interaction with the ‘“‘vacua”’ of 
these fields is described by similar self-energy diagrams, in which the internal loops 
are associated with the appropriate particles. The contributions of these 
diagrams differ in order of magnitude from those of the electron diagrams by several 
factors of m./m, where m is the mass of the particle concerned and m, the electron 
mass. 

The particles whose mass is closest to that of the electron are the muons and 
the pions. The numerical ratios mm, and m,/m, are close to a. The radiative 
corrections from these particles would therefore have to be taken into account 
together with the electron corrections of higher orders. For muons, the radiative 
corrections can in principle be calculated by means of the existing theory, but for 
pions (which are strongly interacting particles) they cannot. 

This places a fundamental limitation on exact calculations of specific effects in 
present-day quantum electrodynamics. The use of arbitrarily high-order corrections 
from the photon—electron interaction alone would be an invalid exaggeration of the 
attainable accuracy. 

The radiative corrections to Coulomb’s law discussed in this section are, as we 
have seen, valid even at distances r<1/m,.. We can now add that the formulae 
obtained cease to be valid at distances r<1/m, (or 1/m,), where polarization of the 
vacua of other particles becomes significant. 


§115. Calculation of the imaginary part of the polarization 
operator from the Feynman integral 


In a direct calculation from the diagram (the loop in (113.1)), the polarization 
operator in the first approximation of perturbation theory would be given by the 
integral 


gp uy qs 
Te oe {tr y’*G(p)y’G(p ~—k) ony (115.1) 


This integral, however, taken over all four-dimensional p-space, is quadratically 
divergent, and in order to obtain a finite result it would be necessary to regularize 
the integral by the procedure described in $112. 

Here, we shall not give the complete derivation, but show how one can use the 
integral (115.1) to calculate the imaginary part of the polarization operator (which 
has been determined in 8113 by means of the unitarity condition); this derivation 
includes a number of instructive points. 

The imaginary part of the integral (115.1) does not diverge and therefore does 
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5 s . . 1: 
not need regularization. For the scalar function im ? = 31m P4 we have 


; _ . Ane’ try"(yp +m)y,(yp tyk +m) 44 
im P =im liza (p?— m?+i0[(p —kY—m? +i)” J 


After calculating the trace, we get 


i d‘p 
im P (k*) = 1m eee 02) 
ig m’+ i0)[((p — k)’— m* + iO] (115.2) 


2e° 2 2 
o(p)=3 32m + pk — p’). 
Let k’>0. We shall use a frame of reference in which k = (ko, 0), and 
(p — k)’ = (po ~ ko)’ — p’. 


Using also the notation ¢« = V(p+m’) (this is not the “energy” of the virtual 
electron po), we can write (115.2) in the form 


ie j id (Do, P) 
im P(k*) = im | d p J dpo (po— €° + i0)[(po — ko)’ — &7 + iO)’ 


(115.3) 
2e° 2 2 2 
(Po, p) = 373 (m+ €° + Poko — Po). 
The integrand has poles at four values of po: 


(a) po=e— 10, (a) po=—e+i0, 
(b) Po=ko—e +10, (b’) Po= kot+ « —i0. 


Figure 21 shows the configuration of these poles; we shall take the specific case 
ky > 0, but the final answer depends only on ké, and not on the sign of ky. We can 


ea’ 
- a” 


: : ri 
a a aR (aaa 
oh a 5" 
— sno 
FiGc. 21. 


calculate the discontinuity of the function A(t) at the cut in the complex plane of 
t = k*=k¢ or, equivalently, on the real axis in the ko-plane. The real part of P(t) is 
continuous at the cut, and the discontinuity is therefore 


AP(t) = 2i 1m A(t). (115.4) 


QE4 - HH 
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We shall first show how the position of the cut may be established from the 
form of the integral. Let I(p, ky) denote the inner integral (over dpo) in (115.3). So 
long as the upper and lower poles in Fig. 21 are at finite distances apart, the path of 
integration over po can be taken far from the poles, as shown by the broken line. It 
is therefore evident that in this case the integral I(p,ko) is unaltered by an 
infinitesimal upward or downward movement of the poles b and b’ respectively 
away from the real axis, i.e. by the change kj > ky +16, 5 > 0. Thus, where kp tends 
to its real value from above or from below, the value of I(p, ko) is the same, and 
therefore makes no contribution to the discontinuity AY. The situation is different 
only if two poles (which may be a and b when ky > 0) are exactly one beneath the 
other, so that the contour of integration is “‘trapped”’ between them and cannot be 
moved to a great distance. Thus the discontinuity A? ~ 0 only if the condition 
ko —€ =, Le. kj =2e =2V(p’+m?), can be satisfied somewhere in the region of 
integration over d°p. For this to be so, we must evidently have ky)=2m, i.e. 
t=4m?.t 

The integral I(p, ko) can be written in the form 


= id (Po, P) 4Po 
PD J) Be MDo— ko) = EF nee 


with the terms i0 omitted from the denominator and the integration contour C 
correspondingly modified, as shown in Fig. 22. We see that the discontinuity AP(t) 
is due to the impossibility of bringing the contour away from the pole a when it is 
trapped between a and b. The contour C is therefore replaced by C’, which passes 


C 2 b z - 
a b 
; : b CN a 
C : : ; 
@) °. 
Fic. 22. 


beneath a, and the integral around a small circle C” centred at a is added. Then C’ can 
always be moved away from the poles without any difficulty, and the integration along 
it therefore contributes only to the regular part of A(t). To determine the required 
discontinuity, we need only consider the integral along the circle C”, which is done by 
calculating the residue at the pole a. This calculation can be made by substituting in the 
integrand 


1 
7—3 > — 27i8(p3- &”); (115.6) 
Doe 
+ It can be shown similarly that there is no cut when t =k’ <0. Taking in this case a frame of 
reference where k = (0, k), we find that the poles of the integrand are at 
po=+(e—i0), po=+(V[(p—k)’ +m’ - i). 


The two lower poles are always in the right half-plane of po, and the two upper ones in the left 
half-plane, so that no pair can be vertically aligned. 
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the minus sign is used because the circle around the pole is traversed in the 
negative direction. In the argument of the delta function, only the zero pp=+ € is 
to be used (the pole a, not a’); this is automatically done if we agree to integrate 
over only half of momentum 4-space (py > 0). 

After the substitution (115.6), the discontinuity of the integral I(p, ko) can be 
calculated immediately: 


AI = {I(p, ko +e 16) a I(p, ko = id )}s +0 
2p | 8(p2— ©2)idb (Do, p) x 
0 


(ky — po)’ — €* +18 (ky — po)’ — &* —i8 Po- 


With the equation 


1 1 - 
Moen pe 


(see (111.3)), we have 


AI =i(2niy' | 8(3~ e°)3L(ko~ po)? ~ €°16(Po.) dbo. 
0 


The arguments of the delta functions can be rewritten in an invariant form by 
adding and subtracting p’: 


pj-€ =p —-m, (ko — po) —&° =(k—-py—m’. 
We then obtain finally 


AP(k?) = imi)? | dp - $(p)8(p?— m)3[(p —kP— m2]. (115.7) 


Because of the delta functions, the integration is actually taken only over the 
region of intersection of the hypersurfaces 


p’=m’, (p—ky=m’. (115.8) 


Since in this region all the 4-vectors p are time-like, the condition of integration 
over po> 0 is invariant (the upper interior region of the cone p’ =m’). 

Let us compare (115.7) with the original formula (115.2). We see that the 
discontinuity of the function “(t) at the cut in the t-plane can be found by applying 
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in the original Feynman integral the substitution 


in the propagators which correspond to the loop lines intersected in the diagram 
(113.1) (S. Mandelstam, 1958; R. E. Cutkosky, 1960). 

The conditions (115.8) select the region of momentum space in which the lines 
of virtual particles in the diagram correspond to real particles (the 4-momenta p 
and p-—k are then said to lie on the mass surface). Here we see clearly the 
relationship to the unitarity-relation method, where these lines are replaced by lines 
of intermediate-state real particles. 

We also observe the mathematical reason for the absence of divergence in the 
imaginary part of the diagram: the integration is over a finite region of the mass 
surface, not over the whole of infinite momentum 4-space as in the original 
Feynman integral. 

In order to derive from (115.7) the formula obtained in §113, we return to the 
frame of reference in which k = 0 and integrate over 


d‘p = |ple de dpo do. 


The integration amounts to removing the delta functions, with 
2 2 2 2 l 
5(p’— m’) dpo = 8(po— €°) dpo > 5 5(Po— €) apo, 


and then 


5[(p — k)’— m*] de = 8[(po— ko)’ — €7] de 
= 5(—2eky + ka) dé 
1 

Fk, d(e — akS) de. 


The result is 


AP(t) = — in? | v(i =a) 6(6,p) do, (115.10) 


where t = k*=ké; the value of ¢ is taken for 
2 


ee | Pi ee Gy ee) 
Po = & =2ky, po=e°—m° = 4ko-m , 


1.e. it 1S 


e? 
b(e, p) = 3-3 (2m? + t) 
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and is independent of the angle. The integration over do reduces to multiplication 
by 47, and we come back to (113.8). 

The only vital point in the foregoing derivation is that the diagram is divided 
into two parts by cutting no more than two lines. The rule stated therefore remains 
valid for diagrams comprising any two sections joined by two (electron or photon) 
lines. The integral calculated by the substitution (115.9) then gives the contribution 
to the imaginary part of the diagram that arises (in the unitarity-relation method) 
from the corresponding two-particle intermediate state. 


§ 116. Electromagnetic form factors of the electron 


Let us consider the vertex operator I = I'*(po, p;; k) in the case where the two 
electron lines are external lines and the photon line is internal. The external 
electron lines correspond to factors u;=u(p,) and u&=a(p2), and I therefore 
appears in the expression for the diagram as the product 


fi = wl *uy. (116.1) 


As already noted in $111, this is the electron transition current, including radiative 
corrections. The conditions of relativistic and gauge invariance enable us to 
establish the general matrix structure of this current. 

The electromagnetic interaction operator V =e(jA) is a true scalar (not a 
pseudoscalar), in accordance with the conservation of spatial parity in these 
interactions. The transition current j;, is therefore a true 4-vector (not a pseudo- 
vector), and hence can be expressed only in terms of other true 4-vectors formed 
from the two available 4-vectors p; and p> (k = p2— p; 1s a third) and the bispinors 
u, and uw. There are three independent 4-vectors of this kind bilinear in # and uy: 


Uryuy, (U2U)) Pi, (U2) po, 
or, equivalently, 


Uyyu, (HU) P, (U2u,)k, (116.2) 


where P = p,+ p>. The condition of gauge invariance requires that the transition 
current should be transverse to the photon 4-momentum k: 


ink = 0. (116.3) 


This is satisfied by the first two 4-vectors (116.2), respectively because of Dirac’s 
equations 


(ypi-m)u,=0, tlyp2—m)=0 (116.4) 


and tecause Pk =0. The current J; 1s given by a linear combination of these two 
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4-vectors: 


Vii = filo) P* + faliny”uy), 


where f, and f, are invariant functions, called the electromagnetic form factors of 
the electron. 

Since the 4-momenta p,; and p, relate to the free electron, pj=p3=m’, and 
from the three 4-vectors p,, p2 and k (which are related by k = p.—p;) we can 
construct only one independent scalar variable, which we take as k”. Then the form 
factors are functions of k’. 

The expression for the current can also be put in other forms with different 
choices of the two independent terms. Using the equations (116.4) and the com- 
mutation rules for the matrices y, we can easily show that 


2 


(zou) )k, = — 2m(iry"U)) + (d2uy)P*, (116.5) 


where o"” = 3(y"y’ — y’y"). The coefficient of this term will later be seen to have 
an important physical significance, and we therefore write 


b= yf(K?)— g(o"k, (116.6) 


where f and g are two other form factors; the reason for writing the factor 1/2m 
separately will be explained below.t For brevity, we shall always use the vertex 
operator instead of the current, the # and u, on either side being understood. 

In order to determine the properties of the form factors, let us consider the 
diagram (110.16) for the interaction of an electron with an external field. The 
corresponding scattering amplitude is 


My = — effi (Kk), (116.7) 


where ‘) is the effective external field (taking account of the polarization of the 
vacuum). 

The amplitude (116.7) describes two reaction channels. In the scattering chan- 
nel, the invariant t is such that 


t = k* = (p,— pi) <0. 


Putting p_ for p» and — p, for p;, we change to the annihilation channel, which 
corresponds to pair production with 4-momenta p-_ and p,. In this channel, 


t=(p_+p.y=4m’. 
The range 0 < t <4m7’ is non-physical. 


+ To avoid misunderstanding, it may be mentioned that in the definition (116.6) k is assumed to be 
the 4-momentum of the photon line coming to the vertex; for the outgoing line, the sign of the second 
term would be reversed. 
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Let us now consider the unitarity condition (111.12). In the scattering channel 
(t <0) there are no physical intermediate states in this case: one free electron 
cannot change its momentum or give rise to any other particles. There are also, of 
course, no intermediate states in the non-physical region. Hence, when t < 4m’, the 
right-hand side of (111.12) is zero, so that the matrix T;; (or, equivalently, My;) is 
Hermitian: 


My = a 


The interchange of the initial and final states corresponds to the interchange of p2 
and p,, and therefore to a reversal of the sign of k. Putting M,; in the form (116.7), 
we therefore have 


[isd Ck) = Fi Fd *(— k). 


Since #°(—k) = 4©*(k), it follows that the transition-current matrix is also 
Hermitian: 


jn =i; when t<4m’. (116.8) 


Using the properties of the matrices y (21.7), we can easily prove that 


(iny"u) = (hyy"u2)*, 


(i,0"" Uy) Se (ujo""ur)*. 


Thus j*, differs from j,;, only in that the functions f(t) and g(t) are replaced by their 
complex conjugates, and it then follows from (116.8) that these functions are real. 
Thus 


im f(t)=im g(t)=0 when t<4m’. (116.9) 


In the annihilation channel (t >4m’), the f state is a pair which can be 
transformed into another pair with different momenta (elastic scattering) or into a 
more complex system. The right-hand side of the unitarity condition is therefore 
not zero, the matrix M;; (and hence j;;) is not Hermitian, and so the form factors are 
complex. 

The analytical properties of the functions f(t) and g(t) are exactly similar to 
those of the function ¥(t) discussed in $111, although it is difficult to prove them so 
directly. These functions are analytic in the complex t-plane cut along the positive real 
axis t > 4m’, with 


f*(H=f(t*), g*(t)= a(t"). 


The renormalization condition (110.19) applied to. the vertex operator (116.6) 
leads to the requirement 


f(O) = 1. (116.10) 
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In order to include this condition automatically (when calculating f(t) from its 
imaginary part), we must apply a dispersion relation of the form (111.8) not to the 
function f(t) itself but to (f — 1)/t. Then we get the dispersion relation ‘‘with one 
subtraction”: 


fee) 


fty-1=4 | Tp (116.11) 


4m 


No values for the form factor g(t) are prescribed by physical conditions. Its 
dispersion relation will therefore be written “‘without subtractions”’: 


g(t) = Lf ree 5 at’ (116.12) 


The value of g(0) has an important physical significance, as it specifies the 
correction to the magnetic moment of the electron. In order to see this, let us 
consider the scattering of a non-relativistic electron in a constant magnetic field 
which is almost uniform in space. 

The term in the scattering amplitude (116.7) which depends on the form factor 


g(k*) is 
5M; = am 9(k*)\(Go"'u)k,A®(k). (116.13) 


For a purely magnetic field, A” = (0, A); since the field is constant in time, the 
4-vector k* = (0,k), and since it varies only slowly in space, k is small. With a view 
to subsequently taking the limit k >0, we have already replaced the effective # 
by A® in (116.13). Expanding (116.13) and expressing it in terms of three- 
dimensional quantities, we find 


5M; = a g(—k’)(a2%uy)ik X Ay, 


where & is the matrix (21.21). The product ik X A, is replaced by the magnetic field 
H,, and we can then take the limit k>0. Finally, with the non-relativistic spinor 
amplitudes w1, w2 given by (23.12): 

il, = V (2m)(w40), 

ui=V2m)("'), 


we have 


5M; = 5 (0) H, -2m(w¥ow)). (116.14) 
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This expression may be compared with the scattering amplitude in a constant 
electric field having the scalar potential ®,: 


Mg a (iy u)) ®, re eD, : 2m(ww)). 


We see that an electron in a magnetic field can be regarded as having an additional 
potential energy 


é 
= Onn go > Hy. 


This means that the electron has an ‘“anomalous”’ magnetic moment (in ordinary 
units) 


wu’ = (eh/2mc)g(0) (116.15) 


in addition to its ‘“‘normal’’ Dirac magnetic moment eh/2mc. 


§117. Calculation of electron form factors 


Let us now go on to the actual calculation of the electron form factors (J. 
Schwinger, 1949). In the zero-order approximation of perturbation theory, the 
vertex operator [* = y", 1.e. the electron form factors are f =1, g =0. The first 
radiative correction to the form factors is given by the vertex diagram 


(117.1) 


with two real external electron lines and one virtual external photon line. We shall 
first calculate the imaginary parts of the form factors. As has been shown in $116, 
these differ from zero only in the annihilation channel (k’ > 4m’); accordingly, the 
4-momenta of the external electron lines in the diagram (117.1) correspond to the 
production of an electron and a positron, and are denoted by p- and — p.. The 
analytical expression of the diagram (117.1) is 


= iew(p\P*u(— p.) = (~ ie) a(pdy't | GD)Y"G@ — Ky Dul) ao u(— ps) 
(117.2) 
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or, in expanded form, 


a ae er ae ib"(p) dp 
Y'F(K) — 552 80k, = | me — PR (117.3) 


with the notation 


b"(p) = —e2 LLP ee (117.4) 


and omitting for brevity the factors u(p_), u(—p+); 1t 1s understood that both sides of 
the equations below are placed between these factors. 

The horizontal dotted line divides the diagram (117.1) into two parts, in such a 
way as to indicate the intermediate state which would figure in a calculation of the 
imaginary part of the form factor by means of the unitarity condition, namely the 
state of an electron—positron pair with momenta differing from p_, p+. The 
intersection also shows where the pole factors are to be replaced in the integral 
(117.2) in order to use the rule (115.9) in the calculation (in (117.3), these factors 
have been separated in the integrand). 

The integral in (117.3) has the same form as that in (115.2), and we can therefore 
immediately write the result of the transformation as in (115.10): 


t—4m? 
t 


2y" im f(t) oom o”’k, im g(t) = ~4n? i} b"(p) doy, (117.5) 


where t = k’, the integration is over the directions of the vector p, and the 4-vectors 
p!=p and p:=k—p in the definition of the function @"(p) (117.4) become the 
4-momenta of real (not virtual) particles. The expression (117.5) relates to a frame 
of reference in which k = 0, 1.e. the centre-of-mass system of the pair p_, p+ (and 
hence of the “intermediate” pair p', p:). In this frame, therefore, k = (ko, 9), 
p_= (ko, p_), p+ = Gko, —p_), p = Gko, p), and it is easy to verify that 


f? =(p — p_-Y = — 2p’ — cos 6) = — 3(t — 4m*)(1 — cos 8), (117.6) 


where @ is the angle between p and p_ (and p’= p2). Not substituting (117.4) in 
(117.5) and eliminating the matrices y”... y, in the integrand by means of (22.6), we 
have 


y" im f(t) aon o"’k, im g(t) 


e? 


i Eig ale Ata dope» H 25 


aa e? do; 7 a ‘. ii 
= eo er gy b~ 2mty" + 4m(P* + 2f*)+ 
+ 2(yp+— yf)y"(yp- + yf)1, (117.7) 
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with the 4-vectors 


f=p-—p-=(0,f), P =p-~p+=(, 2p). (117.8) 


The integration now amounts to the calculation of the three integrals with 
different numerators 


Lt do; 
wh Tuy ai 7.9 


The integral I is logarithmically divergent when 6 — 0. If it is written as 


t—4m2 —(t—4m?) 


I= | d(f)/f = df yf’, 


0 


we see that the divergence corresponds to small ‘“‘masses”’ of the virtual photon, i.e. 
it is an “infra-red”’ divergence, which will be further discussed in 8122; here we 
shall merely note that the divergence is fictitious, in the sense that, when all 
physical effects are correctly taken into account, the divergences of this kind 
cancel out. We can therefore cut the integral off at any lower limit, and then, in the 
subsequent calculation of real physical phenomena, make this limit tend to zero. 

Here it will be simplest to apply the cut-off in a relativistically invariant manner. 
To do so, we assign to the virtual photon f a small but finite mass A («m), i.e. make the 
change 


pPof—w (117.10) 
in the photon propagator D(f’) in (117.2). Then 
—(t—4m2) 


2 no Meee? 
I= | ty = log SS", (117.11) 


0 


The integral I”, in which f" is a space-like 4-vector, must be expressed in terms 
of the 4-vector P“, which (unlike k”, the only other available 4-vector) is space-like 
for all p, and p_. Hence I” = AP”. If this equation is multiplied by P, and the 
integral P,I* is calculated in the centre-of-mass system of the pair (with the 
components of the 4-vectors f and P given by (117.8)), the result is 
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Thus 
T+¥=-—P?, (117.12) 


["" = $P?(ge — “5 -) + EPP”; (117.13) 


to determine the coefficients in this expression, we have only to evaluate the 
integrals I and [”’P,,P,,. 

The calculation continues as follows: (117.11)-(117.13) are substituted in (117.7) 
to obtain a series of terms between u(p_) and u(—p;). In each term we use the 
commutation rules for the matrices y” to “propel” the factor yp, to the right and 
yp- to the left; then we can make the replacements yp_—m, yp.—— m, since 


u(p-)yp-= miu(p-), yp+u(— p+) = — mu(— p,). 
In the resulting sum 
—4(p.p_)Iy" +2mP*" —3P’y", 


we can then replace P” by 2my" +o"’k,, which is equivalent to it when between 
the u and u factors; cf. (116.5). Finally, all quantities are expressed in terms of the 
invariant t =k? (2p.p-=t—2m’, P?=4m’*-—t) and the two sides of (117.7) are 
compared; this yields the following expressions for the imaginary parts of the form 
factors: 


7 am? 
ig) = Fae aa (117.14) 
t—4m?’ 


im f(t) =] Foam | ~3t + 8m?+2C¢ — 2m?) log = : (117.15) 


The infra-red divergence occurs only in im f(t). 

The functions f(t) and g(t) themselves are obtained from their imaginary parts 
by means of (116.11) and (116.12), in which the integrations are effected by the 
same substitutions as were used in §113 to calculate A(t). The form factors are 
given in terms of the variable é (113.11) by 


g(é) = a eee (117.16) 
_,o 1+ é& m _3(1+ é?)+2é 
f(é) 1 = 34 {2(1 +7 S5log €) log MIE log € + 
er (br? —}log® €—2F(€) +2 log é log (1+ 81}, (117.17) 


where F(€) is Spence’s function (131.19). 
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In the non-physical region (0 < t/m? <4), we must put é = e’*. The expressions 
for the form factors can then be written 


o/2 
_af(,__¢ m ,3cos¢+1 2 | 
J($) + 4(1 ae) eet 2 sin d OF a 
0 
(117.18) 
(6) =o (117.19) 
é 27 sin db 
Finally, the following are the limiting expressions: for small |t], 
ft-1l= zoo (log " -3), |t|<4m’, g(t)=a/2r7, (117.20) 


and for large |t], 


fy—1=- (5 log’ M2 1og™ tog Hh) + ae log (2°, eee 


—-t>4m’, 
(117.21) 
_ am’, |tl | fiam’/t, t>4m’, 
g(t) = “ar 08 + 0, as Pecan (117.22) 


Formula (117.21) is valid (as regards ref) with what is called double-logarithmic 
accuracy, l.e. as far as the squares of the large logarithms.t 


§ 118. Anomalous magnetic moment of the electron 


As has been shown in $116, the value of g(0) determines the radiative correction 
to the magnetic moment of the electron. If we seek to calculate only this quantity, 
there is of course no need to find the whole function g(t). From (117.14) and 
(116.12), 


_i f im g(t’) =| 
g(0)= a ; = dt' = rr Syecp = = = (118.1) 
With this correction, the magnetic moment of the electron 1s 


en a 
" =a (1+350), (118.2) 


a formula first derived by Schwinger (1949). 


+ The expression for the vertex operator in the case of one virtual and one real external electron line 
and a real external photon line is given by A. I. Akhiezer and V. B. Berestetskii, Quantum Elec- 
trodynamics, Interscience, New York, 1965, $47.5. 
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In the next approximation (with terms in a’), the radiative corrections in the 
form factors are represented by the seven diagrams (106.10, (c)-(i)). Even to find 
the value of g(0) in this approximation demands very lengthy calculations. Details 
of these may be found in the original papers; here we shall give only the final 
valuet for the correction in the second approximation: 


197 wm | 3 
g(0) = Oe = es 
(0) = (* ) (Bi+s 57 log 2+5 2) 
= — 0.328 a’/a’. (118.3) 


The magnetic moment of the electron is therefore 


— ae (1+ — 0.328" “) (118.4) 


(C. M. Sommerfield, 1957; A. Petermann, 1957). 
Particular consideration may be given to the contribution of the vacuum 
polarization to the correction g®(0), namely the diagram 


(118.5) 


oO; 


which contains a photon self-energy part. It differs from the first-approximation 
diagram (117.1) only by having instead of the photon propagator D(f*) = 47/f? the 
product 


P An P(f? 
pg) 2D pg) = 48 PO. 
ff 
where Y(f’) is the polarization operator in the first approximation (terms in a), 
calculated in §113. Repeating, with this difference, some of the calculations in $117, we 
find as the “polarization part” of the correction 


2 
im g(t) => ES SEAN eee (118.6) 
Vitt — 2 
with 
f? = —4(t —4m’)(1 — cos 6); (118.7) 


+ The calculation by the unitarity method is given by M. V. Terent’ev, Soviet Physics JETP 16, 444, 
1963. 
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see (117.6). When this integral and then 


ie) 


BuO) =— | im gQan(t’) F (118.8) 


4m2 


are calculated, the result is 


2 p 
g2a.(0)= 23 (Ge—3n") = 0.016 5, (118.9) 
which is about 5% of the total quantity (118.3). 

It has already been noted at the end of §114 that vacuum polarization for other 
particles may also make a contribution to the radiative corrections. The con- 
tribution of the muon vacuum to the anomalous magnetic moment of the electron is 
obtained from the same formulae (118.6)-(118.8), in which m is again the electron 
mass m, (this applies also to the definition of the variable f’) but the parameter m 
in the expression for A(f*) must be the muon mass m,. The function P(f’)/f? 
depends only on the ratio f’?/m?,. In the integral (118.8) the important values of ft 
(and therefore of f*) are those comparable with m2; hence the ratio f?/m2 ~ 
(m./m,) <1, and in evaluating the integrals we can use the limiting formula 
(113.14), according to which 


From this we see that the contribution to g7(0) from the muon vacuum polarization 
contains the extra small factor (m,/m,)’. 

The opposite situation, however, occurs for the corrections to the magnetic 
moment of the muon. Since the particle mass does not appear in (118.3), this value 
of g7(0) is valid for the muon also, and it takes into account the contribution of the 
muon vacuum polarization. But the contribution from the vacuum polarization of 
other particles, namely the electrons, is here considerably greater. It can be 
calculated from formulae (118.6)-(118.8), with m, for m and the electron polariza- 
tion operator for A(t). Unlike the previous case, the important range of values is 
now given by f*/m?~(m,/m.)’>1, and P(f*?) must be taken as the limiting 


expression (113.15): 
Pf) a, lf? 
f = = log i. 


The calculation of the integrals gives 


2 
é _ (2) (1), Me _ =) 
[g (0) Jetectr. polar. (=) (; log m, 36 


(H. Suura and E. H. Wichmann, 1957; A. Petermann, 1957). 
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Adding (118.10) and (118.3), we find as the magnetic moment of the muon 


_ eh a 
mon = Fara (I+ 9 +0.76 25 “) (118.11) 


The contribution of the muon vacuum polarization (118.9) is here about 2% of the 
total value of g?(0). The pion vacuum polarization would give a contribution of the 
same order (because of the similarity of the masses), but this cannot be calculated 
exactly, and so there would be no point in finding the corrections ~a?’ in the 
magnetic moment of the muon. 


§ 119. Calculation of the mass operator 
The method of direct regularization of the Feynman integrals may be demon- 


strated by a calculation of the mass operator. In the first non-vanishing ap- 
proximation, the mass operator is represented by the loop in the diagram 


=O (119.1) 
k 


corresponding to the integral 


- ip) = (ie [ "G0 - Wy Dl Ss 


on substituting the propagators and combining the factors y”...y, by means of 
formulae (22.6), we get 


-,. 8m 4 2m — yp + yk 4 
M(p) = (2a) © (ok dea k (119.2) 
(the bar over .4 denotes the non-regularized value of the integral). The fictitious 
‘photon mass’ A is used in the photon propagator in order to eliminate the 
infra-red divergence (as in §117). 

The integral may be transformed by means of formula (131.4), with a, and a> 
the two factors in the denominator of (119.2). A simple rearrangement of the terms 
in the denominator of the new integral gives 


—~. + 8m 4 2m — yp + yk_ 
Wp) =- GAGE ac ‘fo ere (119.3) 


with 
a’ = m’x?—(p*?— m’)x(1—x)+A7(1—x). (119.4) 
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The change of variable k >k + px brings the integrand in (119.3) to a form in 
which its denominator depends only on k?; according to (131.17), (131.18) a 
constant is then added to the integral: 


NGi== Baie {| te fas ens a en *)_ 1 inp |. (119.5) 


(The term in yk in the numerator is here omitted, since it gives zero on integration 
over the directions of the 4-vector k; cf. (131.8).) 

The regularization of this integral involves subtractions such as to reduce it to 
the form (110.20). The latter expression gives zero on multiplication by the wave 
amplitude u(p), if p is the 4-momentum of a real electron. Without bringing in u(p) 
explicitly, we can formulate this condition as stating that “(p) vanishes when we 
substitute 


yp>m, p’>m’. (119.6) 


The form of the integral (119.5) is convenient in that the 4-vector p appears in it 
only as yp and p’, but not as kp. 

Subtracting from (119.5) a similar expression after the substitution (119.6), we 
get 


ear | dsm — wea eae 
- [ore forge a3 OP ~ m)— Lin(yp — m)}, (119.7) 


with ag = m’x?+A7(1—x). 

To complete the regularization, however, a further subtraction is needed: 
according to (110.20), the substitution (119.6) should reduce to zero not only (p) 
itself but also “(p) without the factor yp—m. A corresponding subtraction 
removes entirely the second and third terms in the braces in (119.7).f The first 
integral is transformed by incorporating a further integration (using (131.5)), taking 
n=2,a=k?—a’, and b = k*— ad. Then (119.7) becomes 


i 1 
167i (yp + m)[2m — yp( — x)]x(1 — x) 
Op ™) Ony y* * fate {dx [az [k?— a+ (p*— m*)x(1— x)z] 


+ Thus, in the process of renormalization “en route” ($110) we omit corrections to the renor- 
malization constant Z|. The corresponding integrals are logarithmically divergent. If we use a “cut-off 
parameter” A’> m7’, p’, and limit the region of integration over d*k by the condition k? <A’, the 
correction can be explicitly calculated; the result is 


ZeveZy.. Zo2=— 1 BS ot r” 9 119.7 
a Dae [28 mt OF m2” 4} cae 


QE4 - Il 
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where we have also used the identity p>— m* = (yp — m)(yp + m). The integration 
over d‘k is immediate: assuming that p’— m?<0 and using (131.14), we have 


> ae : (yp + m)[2m — yp(1 — x)]x(1 — x) 
(yp me | & | ae m’*x?+A*(1—x)+(m?— p”?)x —x)z’ 
0 


We now need to subtract a similar integral with the substitution (119.6), omitting 
temporarily the factor yp — m; after some simple calculations, we get 


1 1 
2 
Mon 2p —mys- | dx | dz x 
0 0 


m(1—x°*)—(yp +m)(1- x1 -=5 Oat 


Xt (m—pl-xyz os 


(the term in A”? is omitted from the common denominator, since this causes no 
divergence in the present case; elsewhere, A7(1— x) is replaced by \”, since the 
infra-red divergence will correspond to the divergence as x > 0). 

The integration in (119.8) (first over dz and then over dx) is fairly lengthy but 
elementary, and the result is 


ee eee mee eee = 
M(p) = 52,5 (yp ~ m)'{aq— (1-7 P og 0 


wat apg or Pte toep) +1210] 
mp 1 —p) \" aes logp )+1+2log——|7, (119.9) 


with 
p =(m’— p*)/m? 


(R. Karplus and N. M. Kroll, 1950). The integral has been calculated on the 
assumption that p >0 and p>Aj/m. In accordance with the rule for passing round 
poles, in the analytical continuation of (119.9) into the region p <0 the phase of the 
logarithm is found by making m > m — i0; then p—> p — i0, and log p must be taken 
for p <0 as 


log p =log|p|—iz, p<0. (119.10) 


Let us now consider how the mass operator behaves when p*>m?’. Then 
— p ~ p’/m’>1, and with logarithmic accuracy we have 


M(p) = —[S"'(p)- G""(p)] 


2 
ache Pp 
ri (yp) log me (119.11) 
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As in the case of the photon propagator (cf. formulae (113.15) and (113.16) for the 
polarization operator), the correction to G™' is small only when the energy is so 
small that 


In the present case, however, the logarithmic increase is in a certain sense 
fictitious, and can be eliminated by a suitable choice of gauge, i.e. of the function 
D in the photon propagator (L. D. Landau, A. A. Abrikosov and I. M. Khalat- 
nikov, 1954). To achieve this, we put (in the notation of §103) 


DY =0, (119.12) 


whereas formula (119.9) has been derived with the gauge 
Do =>), (119.13) 


This property of the gauge (119.12) makes it especially suitable for investigating the 
theory when p’*> m7’, as we shall do in §132. 

To prove the result stated, we note that, if only terms in e’ are concerned, the 
transformation from the gauge (119.13) to (119.12) can be regarded as infinitesimal. 
Accordingly, we can apply immediately the formula (105.14), with 


d®(q) = — Diq? = — 4al(q’y’, 


and also replace @ in the integrand by G with the necessary accuracy. In the 
integral over d‘q, the important range is gq > p, for which G(p — q) in the integrand 
is much less than G(p) and can be neglected. Then 


5G°-'=-—G~*(p)8G(p) 
= —ie’G(p) | d(q) d*q/(277)’. 


Finally, with the transformation (131.11) and (131.12), we have 


59-Xp)=-F- Gp) | TP 


2 Z 


where A is an upper limit, at which the divergence is removed by renormalization; 
the renormalization consists in subtracting the same expression with p’~m_?’, 
giving finally 

ie 


6g '= yp nee 
Aa m~ 


This just cancels the difference @'— G" in (119.11). 
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Lastly, let us consider why it is necessary to use a finite “photon mass” A in 
regularizing the integral (119.2), which is closely related to its behaviour when 
p’-m_’. Firstly, this integral itself is finite when p* = m’ and A = 0; to exclude the 
divergence for large k, which is here unimportant, we assume that the integral is 
taken over a large but finite region of k-space. The need to use A arises in the 
subtraction of the renormalization integral, which would otherwise diverge at 
p’ =m’. Let us therefore ascertain how the non-regularized mass operator would 
behave as p’> m7’. Since this behaviour depends essentially on the choice of gauge, 
we shall consider the general case of an arbitrary gauge, whereas the integral 
(119.2) has been written for the specific gauge (119.13). 

We again apply the transformation (105.14). Writing 


dq) = pili 


=a ia Sa(q?), (119.14) 


we assume that 5a is the variation of a function a(q’) which changes appreciably 
only over intervals q’ ~~ m’ and is finite when q’ ~ m’. In the integrand on the right 
of (105.14), the two terms in the difference 4(p) — G(p — q) are almost equal when q 
is small, and the integral converges. For small q, 


1 
G(p - q)~ = ——__., 
(p — 4) p’—m’-—2pq 


and so 4(p — q) may be neglected in comparison with G(p) when gq > (p*— m’)/m. 
The integral 


55(p) = ie(p) | a% a) 5 


= - £90) [ saca) “CP q) 


diverges logarithmically in the range 


(p*—m’y ~m) <q’ <m’. 
m? 


We therefore have, with logarithmic accuracy, 


5G? ; m? 
Gd, da(m VOB 2 ne 


This can be integrated as follows. When a = e* 0, the exact propagator 4 must 
be the same as the free-particle propagator G, and therefore 


l ( m2 een 


gs ae oer pe—m- , (119.15) 
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where ay = a(m’) and C is a constant. To determine C, we compare the expression 
G\(p) = (yp ~ m)| 1+ (C- av) loz p |, (119.16) 


which is obtained from (119.15) in the first approximation with respect to a, and the 
corresponding expression given by the integral (119.2) when A = 0:T 


G'(p) =(yp -m)|1+= log |. (119.17) 


According to the definition (119.14), the function a(q’) is equal to the ratio D/D. 
The gauge (119.13) to which (119.17) belongs therefore corresponds to a = a) = 1. 
Equating (119.16) and (119.17) for this value of ao, we find C = 3. 

Thus we have finally as the limiting (infra-red asymptotic) expression for the 
unrenormalized electron propagator with p?—> m? 


+m m2 a(3—ag)/27 
9(p) = Bo (ot) (119.18) 


(A. A. Abrikosov, 1955). The validity of this formula depends only on the 
inequalities a <1, |log p|>1, whereas the formulae of perturbation theory would 
require also that al log p|/2a7 «1. The sign of the difference p’— m7’ is also unim- 
portant here, since the imaginary part of (119.18) is in any case beyond the limits of 
its accuracy. 

The renormalized propagator must have a simple pole when p’= m7’. We see 
that (119.18) satisfies this condition only in the gauge where 


D” =3D (119.19) 


(so that ay=3). Then the regularization of the Feynman integral (in order to 
prevent its divergence at the upper limits) will not require the use of a finite 
‘photon mass’’. In other gauges, the zero mass of the photon produces a branch 
point instead of a simple pole at p? = m7’, and the finite parameter A is needed in 
order to remove this “defect”’. 


§ 120. Emission of soft photons with non-zero mass 


In calculating the electron form factors in §117, we encountered a divergence of 
the integrals when the frequencies of the virtual photons are small. This divergence 
is closely related to the infra-red catastrophe discussed in §98, where it was pointed 
out that the cross-section for any process involving charged particles (including the 


+ It is not necessary to repeat the calculations in order to derive (119.17). The term in log p in (119.9) 
is obtained on the assumption that p > A, which allows the limit A > 0 to be taken. The term in log (A/m) 
arises from the subtraction of the renormalization integral, and is not present in the original integral 
(119.2). The subtraction is easily seen to have no effect on the log p terms. 
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scattering of electrons by an external field, represented by a diagram such as 
(117.1)) has no significance in itself, but only when the simultaneous emission of 
any number of soft photons is taken into account. It will be shown in $122 that all 
the divergences cancel in the total cross-section, which includes the emission of 
soft quanta. Here, of course, in order to obtain the correct result it is necessary for 
the initial “cut-off”? of the divergent integrals to be taken in the same manner in all 
the cross-sections in the sum. 

In §117, this cut-off was applied by means of a fictitious finite mass A of the 
virtual photon. We must therefore now modify the formulae of §98 in such a way 
that they describe the emission of soft “photons” with non-zero mass. 

Formally, such a photon is a “‘vector’’ particle with spin 1, whose free field has 
been discussed in §14. Such particles are described by a 4-vector W-operator 


b. = V (4a) > Tio) (Cxa 0? e ™ + Ef, e@* e™), a =1,2,3; (120.1) 
k, a 


the notation and normalization differ from that in (14.16), in order to bring them 
into line with the photon case. 

The interaction of the “‘photons”’ (120.1) with electrons is to be described by a 
Lagrangian of the same form as for true photons: 


— ef", (120.2) 


the potential A, being replaced by Wy. Then the amplitudes for the processes of 
emission of photons with finite mass will be given by the usual rules of the diagram 
technique, the only differences being that 


=X (120.3) 


and that the summation over the polarizations of the emitted photon must be taken 
over three independent polarizations (two transverse and one longitudinal) instead 
of two as for the ordinary photon.This is equivalent to averaging with respect to 
the density matrix of unpolarized particles 


kk, 
Puv = — (8. _ | (120.4) 


(cf. (14.15)), followed by multiplication by 3. 
The propagator for “photons with non-zero mass’’ is 


4 kk, 
Duo = Gray (Be — A") 


(cf. (76.18)), but in the case of gauge invariance the amplitudes of real scattering 
processes do not depend on the longitudinal part of the photon propagator, and this 
property is not the result of the specific form of its transverse part. The second 
term in the parentheses can therefore be omitted, leaving an expression of the same 
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type as for ordinary photons: 
At 
D,, = ke k?— ? Luvs (120.5) 


as has been used in 88117 and 119. 

Let us now consider the emission of soft photons (in the sense explained in 
$98). The derivation of (98.5) and (98.6) can be applied to the present case, the only 
difference being that the term k’ = A’ is added in expanding the squares (p +k)’ in 
the denominators of the electron propagators. We thus have, instead of (98.6), 


_ pe > d’k 
MOOG ae + oa pk—A2/2| 470’ 

where do,; is the cross-section for the same process without emission of a soft 
quantum, which we shall conventionally call an “elastic process’. In the in- 
tegrations over d°’k, the important range is |k|~ A. Then p’k ~ pk > A’, so that the 
terms in \’ in the denominators may be neglected. The summation over polariza- 
tions of the photon is carried out by means of (120.4), as already described. When 
the approximation stated is used, the second term in (120.4) makes no contribution 
to the cross-section, and there remainst 


! 2 3 
= of Pp’ _ ip dk 
| ne ee Gas aa) ot. (120.6) 


We thus recover formula (98.7), but w must now be taken as 
w = V(k?+ A’). (120.7) 


Formula (120.6) is completely general: it is applicable to both elastic and 
inelastic scattering and even when the type of particle changes. The result of the 
further integration over d°*k depends on the 4-vectors p and p’, i.e. on the nature of 
the basic scattering process. 

Let us take the case of elastic scattering, for which 


Ipl=|p'l, e=e', 


and determine the total probability of photon emission with a frequency less than 
SOME Wmax, With the assumption that 


X < wmax (120.8) 


and that @max 1S subject to an upper limit governed by the conditions (98.9) and 
(98.10) for the theory of soft-photon emission to be valid. We first calculate the 


+ There may be some doubt at first as to the validity of neglecting A” before averaging, since it 
occurs in the denominator of the second term in (120.4), but we can easily show directly that this term 
gives, on averaging, a contribution ~ A‘: 1/\’*, which is negligible. 
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integral over d°*k in the non-relativistic limit. For |p| = |p'| <m, 


(2. 7B.) = (ase q 
(p'k) (pk) 2.4 Des 


mé@ mW 


where q = p’ — p. Integration over the directions of k gives 


4nq’ (Kk 
ma’ ar ; 
Then, from (120.6), 


i 2 2 
_ eg __k k* d\k 
Co ae a? | ! 3(k? + A?) Ceetie 
0 
or, after integration with the assumption that wmax/A > 1, 
r 6 


2 
do = doe: 24 (log “eon 2), q?<m’. (120.9) 


In the general relativistic case, the integral 1s calculated by means of (131.4). 
The angle integral is then 


dow do. 
(pk)(p'k) =f Fersresic veer [(pk)x + (p’k)G — x)P 


or, eXpanding the scalar products with p = (e, p), p’ = (e, p’), 
a do, 
: =| 4 | eo= -[px+p’(i-x)]}* 


The inner integral is now easily calculated in spherical coordinates with the polar 
axis along the vector px + p’(1— x), giving 


-| An dx 
(ew) — [px +p’ - x) Pk? 


1 

-{ 4m dx 

[m? + q°’x(1— x)]k’+ €7A7 
0 


The other two integrals, with (pk)’ and (p'k)’ in the denominators, are derived from 
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this by putting q = 0. Using also the formula 


pp’=e’—p:p'=m’+3q, 


1 
= | ax J ae 
T (k?+ \*) Um? + q?x(1—x)]k?+ 67A°) om’k’ + &7Q7J' 
0 


(120.10) 


we get 


The integration over d|k| calls for the calculation of integrals having the form 


®max 


J kale 

(ak’ + A7)V (k?+ A’) 

_1 f dik} dk} 

a J (k°+A*) a _. (k? + 7) 


asl, Detoas_1 f dz 
a” a} (az?+ 1I)V(22+ 1) 
0 


In the second integral we have put Az for |k| and replaced the upper limit @nax/A by 
infinity; this is permissible, since the integral converges. 

The integrals over dx which then occur in (120.10) cannot be expressed entirely 
in terms of elementary functions. The result may be written in the form 


do = a [F (4a) log “9+ F, | doe, (120.11) 
wheret 
2 
FO==|xigep eet Vet D1], (120.12) 
2€ e + |p| 
ap °F 


! at ft Gi log a La (120.13) 


= = [m?+q°x(1—x)]. 


+ The function F(€) has already occurred in §98, Problems. This is not surprising, since (120.11) can 
be derived with logarithmic accuracy by integrating the cross-section (98.8) for the emission of 
zero-mass photons over dw from A tO wmax. 

If € is replaced by a variable 6 such that & = sinh +0, then 


F (8) = (2/7)(@ coth 6 — 1). (120.12a) 
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An asymptotic expression for the cross-section in the ultra-relativistic case can 
be obtained, assuming not only that «>m but also that |q|>™m, i.e. that the 
scattering angle is not very small. Then the important range of values of x in the 
integral in (120.13) is that for which a <1; the appropriate approximations give 


F, =~ 


1 

2 

q log a 

| a at 
0 


1 flog (q’/e*) + log x + log (1—x) 
ee f eS dx. 
2 x(1—x) 


The integral is to be cut off at a~1, ie. at x~m/7/q’ at the lower limit and 
1—x ~m/’/q’ at the upper limit. Then 


wail qo 4 1.24 
ae [2 tog 4s log-4 log | 
=5, [lo 2G 410 AiG 4] 
2a em ein ein ; 


This formula is valid as far as the squares of the logarithms (with double- 
logarithmic accuracy). To the same accuracy, it is sufficient to put in the first term 
in (120.11) 

F(é)~ (4/7) log € (€> 1). 


The final result is 


_ 2a Gis: Wiis e, q 1, »q 2 2 
do = = log m2 08 " log-log mit 4108 = doi, qg >m’. 


(120.14) 


§ 121. Electron scattering in an external field in the 
second Born approximation 


In the first two approximations with respect to the external field, the scattering 
of an electron is represented by the diagrams 


19 


! apt y 197 ‘Pp 
Yn p—>—-* (121.1) 
7 (l) : P=prd . 

M mM”) 


The first of these corresponds to the amplitude M“” ~ Ze’ considered in §80. The 
amplitude of the second approximation is M® ~ (Ze’)’. 
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It is easily seen that terms of the same order as this arise from the radiative 
corrections. In the third order of perturbation theory, the radiative corrections to 
the scattering amplitude are represented by the diagrams 


q Vq 


(121.2) 
é 
Here M® ~ Ze’: e*, and M® ~ M® (if Z ~ 1). 
According to (64.26), the scattering cross-section is 
do =|M$? + M3? + MY? do’/167’. (121.3) 


In the squared amplitude we can retain not only |M$)|’ but also the interference 
terms between M$} and M‘} and between M'$;) and M$). Thus the cross-section is 
given, as far as the terms in e°, by the sum 


do = dao" + dao + dora, (121.4) 


where do” is the cross-section in the first Born approximation (§80), and the 
corrections are 


do =2re M$) M?* do'/167’, 


Adyaa = 27 M$? M$?* do'/16 7°. (121.5) 
From §80, 


M}) = |e|(a’y°u)®(q), (121.6) 


where ®(q) is a Fourier component of the scalar potential of the constant external 
field (© = Af), and we have used the fact that the electron charge e = — |e]. 

The two expressions (121.5) can evidently be calculated independently. The first 
will now be discussed, and the second in §122. 

The second-approximation amplitude is given by the diagram (121.1) as the 
integralt 


3 
Mp=—e | {apy AT yup)fop'—pnot-p ob. (12.7 


The ““4-momenta”’ q,;=f —p and q.=p’-—f of the constant external field have no 


+ Here it is necessary to apply the diagram-technique rule concerning a constant external field; see 
rule 8 in §77. 
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time components. Hence 
fo=e=e’, (121.8) 


where e and e’ are the initial and final electron energies, which in elastic scattering 
are the same. 
In the purely Coulomb field of a stationary charge Zle|, 


@(q) = 47 Zle|/q’. 


For this potential the integral (121.7) is logarithmically divergent (when f ~ p and 
f~p’). This divergence is specific to the Coulomb field, and arises from the 
slowness with which the field decreases at large distances. Its origin is most easily 
shown for the non-relativistic case. According to QM (135.8), the coefficient of the 
spherical wave e'"’/r in the asymptotic expression for the wave function of an 
electron in a Coulomb field is 


f(6) exp (- j ay log ip| r), 


This coefficient is also the electron scattering amplitude in the field, and we see that 
its phase includes a term which diverges as r->%. When the scattering amplitude is 
expanded in powers of Za, this term causes divergence of all the terms in the 
expansion from the second term onwards (since the function f(@) is proportional to 
Za). In the relativistic case there is, of course, a similar situation. 

These arguments also show that the divergent terms must cancel when we 
calculate the scattering cross-section, in which the phase of the amplitude is 
unimportant. The simplest procedure for a correct calculation is to consider first 
the scattering in a screened Coulomb field, putting 


®(q) = 47Z|el/(q’ + 5°) (121.9) 


with a small screening constant 6<|p|. This eliminates the divergence in the 
scattering amplitude, and we can then put 6=0 in the final formula for the 
cross-section. 

Substituting (121.9) in (121.7), we have 


Mi=—2 2araply’e + mi +y Fu), 


with the notation 
n={ ee 
[(p' — f)° + 8°][(f — p)’ + 87] [p?— f? + i0]’ 


f d°f 


(121.10) 
I= | HT TE pes op Po Ot PD 
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p’ = e°—m’=p”, and the integral J is symmetrical in p and p’; from considerations 
of vector symmetry, it is immediately obvious that the vector J must be parallel to 
p+p’. Now, eliminating the matrices y by means of the equations 


ypu =(y'e—m)u, 
ii'y -p'=ii'(y’e —m), 


we obtain 
Mi) = -= Za7ai(py'e(S, + J) + m(T,— J] u(p). (121.11) 


In order to continue the calculations, we change (as in §80) from the bispinor 
amplitudes u and u’ to the three-dimensional spinors w and w’ which correspond to 
them in accordance with (23.9) and (23.11). A direct multiplication gives 


i’u=w'*{(e +m)—(e—m)cosé0+iv:o(e—m)sin 6} w, 
a'y°u = w'*{(e +m)+(e—m)cos 0—-iv: o(e —m) sin 6} w, 


where v=nXn'/sin6, n=p/|p|, n'=p’/|p'|, cos@=n-n’. Then the amplitude 
(121.11) may be writtent 


M'? = 4rw'*(A® + B? p+ o) w, 


1 
Oya 
° 2 


5,2 2 a'{[(e +m)+(e—m)cos 6] e(J,+ J,)+ 


(121.12) 
+[(e + m)—(e—m) cos 0] m(J; — Jy)}, 


B= a Z?a%(e — m) sin 6[e(J1 + Jn) — MJ, — Jb). 


The first-approximation scattering amplitude is, in corresponding notation, 


M$) = 4nrw™ (A+ BO p+ a) w, 


AO = 53 [(e +m) +(e ~_m) cos 6], (121.13) 
BO =—i Fle —m) sin 6, 


where q=p’—p. 
The scattering cross-section and the polarization effects are expressed in terms 
of the quantities A = A+ A® and B = B“+ B® by the formulae derived in QM, 


+ The definition of A and B is as in §37 and in QM, $140, and differs by a factor from that in §80. 
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$140. For example, the scattering cross-section for unpolarized electrons is 
do = (\A|’+|B]’) do’ 
= do +2(A re A? — iB” im B™) do’. 
Substitution of (121.12) and (121.13) and straightforward calculation gives 


353023 
do® = — do’ 5,25 [(1 — v? sin? 46) re (J, + Jp) + (m?Je) re (J, — Ja), 
Tp’ sin’ 30 
(121.14) 


where v= p/e is the electron velocity and @ the scattering angle. The electrons are 
polarized by scattering, and the polarization vector of the final electrons is 


, 2re(AB =) 
o = TAP+ (BE 
2(A” re B®? — iB” im A®) 
~ (AMPS IBOR 


or, substituting (121.12) and (121.13), 


4:31 1 
_ 4Zamp ee ere im (J; — Jn) v. (121.15) 


! 
G TE 


Let us now calculate the integrals J; and J,. This is more easily done by using 
the parametrization method (131.2). The integral J; then becomes 


11 ti 


Sens d°f dé, dé, dé, (1 — £:- &- &) 
Hi YT Iter + 8°], + [(p —f) + 8° & + [f° — p* — 0] E53} 


0 0 


The integration over dé; eliminates the delta function, and reduction of the 
denominator then gives 


1 1-& 
a — A fde de 
o : J J {8°(E + &) + p(2E, + 2 — 1) — 2E - (Erp! + Ep) +f — 10} 


Using in place of f a new variable k = f— é\p’— &p, we can reduce the integration 
over d°f to the form 


{ d?k ow? 
(k—a?—id) | 4a” 


so that 


| dé, d&, 
J {p’°(éi + €3— 2& — 2& + 1) + 2é:Ep- p’— 8°(E, + &) — iO" 
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Instead of €, and € we use the symmetrical combinations x = €&,+ &, y = &— &. 
The integration over dy from 0 to x is elementary, and gives 


i 


i= x dx 
~ a [bx?— 2x + 1—(87/p’)x — i0][ — x) — (8?/p?)x — id)” 


where 
b =(p°+p:p’)/2p 
= cos’ 30. 


To calculate the integral over dx as 6-0, we divide the range of integration 
into two parts: 


1 1-3, 1 


[---ax= { ange | dx, 1> 81> 8/|p|. 


0 0 i3j 
In the first integral we can put 6 = 0; then? 
Ls, 


ae 1 (1-xy |” 
| = oT ah | 8s 79x 4+1-i0 
0 


“aq - sao [low ep + in] 


In the second integral we can put x = 1 everywhere except in the term (1— x)’ and 
5 = 0 in the first bracket in the denominator. Thent 


| a oe eee dx’ 
~  1=b J fx?—(87/p?) — id} 
5; 


Tol) wer 


dx' 
[(8°/p*) — x”]"” 


= eee [tog APL + sin | 


5 


+ The term i0 arises from the rule for avoiding the singularity, which gives the change in the argument of 
the logarithm between 0 and | — 5;, namely from 0 to — 7 as we pass below the branch point. 

+ Here again the singularity avoidance rule gives the sign of the square root as we go from positive to 
negative values of the radicand. 
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On adding the two integrals, 6, disappears, as it should, leaving 


= im’ 2\p| - 4 
N= apt ant 108 ( 5 sin 16). (121.16) 


The integral J, is calculated similarly: 


os a°(1— sin 30) im? 4 
J,= J, A[p|’ cos? 46 sin 10 pF cos* Fo [08 87 28. (121.17) 


We now have only to substitute these expressions in (121.14) and (121.15), 
obtaining as the final results 


2) m(Za)e 2 gel ; 
do app ey (1 — sin 36) do’, (121.18) 
, 2Zam\p|_ sin’? 30 log sin 30 
C= Lore — — vy sin?40) cost een 


(W. A. McKinley and H. Feshbach, 1948; R. H. Dalitz, 1950). 

In the first Born approximation, the electron and positron scattering cross- 
sections are the same (in the same external field); in the second approximation, this 
symmetry does not occur. In the scattering of a positron (charge +|e|) the 
amplitude of the first approximation (121.6) has the opposite sign, but the sign of 
M$? is unchanged. The cross-section do™, which is the interference term between 
M}) and M'?, therefore changes sign. The same occurs for the expression (121.19) 
for the polarization vector. The formulae for electron scattering are all converted 
to those for positron scattering by the formal change Z > — Z. 


§ 122. Radiative corrections to electron scattering in an 
external field 


Let us now calculate the radiative corrections to electron scattering in an 
external field (J. Schwinger, 1949). The corresponding part of the scattering 
amplitude is represented by the two diagrams (121.2). The contribution from the 
first of these to the amplitude is 


P(—q’) 


An 


—(ii' y°u) D(— q’)- e®(q), 


where Y(-—q’) is the polarization operator corresponding to the loop in the 
diagram. The contribution from the second diagram is 


— (ii'A°u)e®(q), 


where A° is the correction term in the vertex operator ([™“ = y" + A“); according to 
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(116.6), 

= V'If(—@)- -= 0" aug(— @). 
Adding the two contributions, we havet 


Mf = — (it! y’ Qua —— 


(122.1) 
Qra(q) = f(—q?)- 1--5 pa Cts 7 8 Ga" Y 

Let us first consider the infra-red divergence in the form factor f(—4q’) and 
therefore in the scattering amplitude (122.1). It has already been mentioned in §98 
that the exact value of the purely elastic scattering amplitude is zero, i.e. it has no 
meaning. The only physically significant thing is the amplitude of scattering defined 
as a process in which any number of soft photons can be emitted, each having an 
energy less than a specified value w,,, satisfying the conditions for soft-photon 

emission theory to be valid. That is, only the sum 


®max Mmax Omax 


da = do.,+ dog | dw,, + doe: iM | dw,,, | GW: (122.2) 


0 0 0 


is meaningful, where do,, is the cross-section for scattering without emission of 
photons, and dw,, the differential probability for the emission by the electron of a 
photon with frequency w. Here it is assumed that do, itself is calculated as a 
perturbation-theory series, i.e. as an expansion in powers of a.t Then, on bringing 
together the terms of each order in a in (122.2), we obtain do as an expansion in 
powers of a in which each term is finite. 

In the first Born approximation, do.~ a’. This term has, of course, an in- 
dependent significance. If, however, the next correction (~a’) to do. is to be 
taken into account, we must also include the second term in the sum (122.2): since 
dw, ~ a, multiplication by do. ~ a’ likewise gives a quantity ~a°*. We shall show 
that the infra-red divergence disappears when these two quantities are added. 

The divergent term in the form factor f (117.17) is§ 


—3aF (\q|/2m) log (m/A). 


The corresponding term in the amplitude (122.1) is 


1aF log (m/A)(a'y°u) e®(q), 


+ Note that q, = (0,—q) if gq =(0,q), and therefore o°" q, = — y’q- y. 

+t The need to take account of radiative corrections in the ArobAbility dw. 1s governed by the value 
of wmax; the limit w > 0 corresponds to the classical case where the radiative corrections are zero, and so 
the latter can always be made small by taking a sufficiently small @max. 

§ This expression 1s easily verified by using the relation 


lqlim =(1-/VE 


between |q| and the variable é in terms of which (117.17) is written. 


QE4 = JJ 
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and in the cross-section (121.5) 
doi“ = — aF log (m/A)|a'y°u||e®(q)/’ do'/1677’. 
Comparing this with the Born cross-section 
do = |ia'y°ul|e®(q)|? do'/167’, 
we find that 


doi = — wF log (m/A) do. (122.3) 
The second term in (122.2), with f dw, from (120.11), gives 


W max 


doe | dw,, = aF log (2@max/A) dao. (122.4) 


0 
Finally, adding (122.3) and (122.4), we obtain 
— do aF (\q|/2m) log (m/2@max)- (122.5) 


We see that the divergent contribution from soft (\k| ~ A) virtual photons does 
in fact cancel with that from the emission of real photons of the same kind. A 
similar result occurs in any other scattering process. 

There is also a dependence of the scattering cross-section On @max, resulting 
from the fact that w,,., appears in the definition of scattering as a process in which 
any number of soft photons can be emitted. The cross-section for such a process 
will of course decrease with the upper frequency limit ®,,, for photons whose 
emission we regard as belonging to the scattering process in question. 

Let us now determine the complete radiative correction to the scattering 
cross-section. Proceeding in accordance with the standard rules (see (65.7)), we find 
as the cross-section averaged over the polarizations of the initial electron and 
summed over the polarizations of the final electron 


do = da ae Arad 
= |eB(q))’ tr{(yp’+ my" + y’Qraa(yP + My" + y’Qraa)} do'/32m*. (122.6) 


According to (122.1), 


Qa=atby-q, Qra= y’Qiay® =a-by-q, 


a=f(—q)-1- 2 A(-4@), b=5,,8(-4)). 


As far as the terms linear in a and b, the trace in (122.6) is given by 


4tr{- + -} = 2(e?—4q’)(1 + 2a) — 2bma’. 
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Hence 


1 2 
AC rad = 24 fu aq l= a PAG) = ae - g(- ¥)| do", (122.7) 


where do” is the Born cross-section (80.5) for the scattering of unpolarized 
electrons, and a subscript A is added to the form factor f in order to show explicitly 
that it is cut off at photon mass X. 


We now have only to add to (122.7) the cross-section for the emission of soft 
photons. If we write f, in the form 


fi(—q’) = 1-3aF (\q|/2m) log (m/A) + @Fo, (122.8) 
then from (120.11) this addition simply means replacing f, in (122.7) by 
fogs = 1—30F ({q|/2m) log (m|2emax) + 20F\ + oF. (122.9) 


With this change, (122.7) gives the final answer. 
In the non-relativistic limit we havet 


2 
—_ 1-329, (logs +54), g? <m?. (122.10) 


The particular form of the external field appears in the radiative correction to 
the cross-section only through do”; the factor in the braces in (122.7) is universal. 
In the non-relativistic approximation, 


2 
dona=— do” = (log 5™ +), g? <m?, (122.11) 


which includes contributions from all the terms in (122.7). In the opposite (ultra- 
relativistic) limit, the main contribution comes only from the term in f,,.— 1: 


gq? > m?. (122.12) 


max 


Finally, it may be noted that the radiative corrections considered here do not 
cause any additional polarization effects that are not present in the first Born 
approximation (unlike the corrections of the second Born approximation, discussed 
in §121). The reason is that the particular features of the first Born approximation 
are ultimately due to the fact that the S-matrix is Hermitian. This property is 
maintained even when the radiative corrections described above are taken into 


+ This differs from the non-relativistic formula (117.20) by the change 


log rN ard log 2Wmax a - 
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account, since in this approximation there are no real intermediate states in the 
scattering channel (and so the right-hand side of the unitarity relation is zero). 


§ 123. Radiative shift of atomic levels 


The radiative corrections cause a shift of the energy levels of bound states of an 
electron in an external field, called the Lamb shift. The most interesting case of this 
kind is that of a hydrogen atom (or hydrogen-like ion). 

A consistent method of finding the energy-level corrections is based on the use 
of the exact electron propagator in an external field ($109). But, if 


Za <1, (123.1) 


it is possible to use a simpler procedure in which the external field is regarded as a 
perturbation. 

In the first approximation with respect to the external field, the radiative 
correction in the interaction between an electron and a constant electric field is 
described by the two diagrams (121.2) already used in connection with the problem 
of electron scattering in such a field; the change from one problem to the other 
needs no more than a simple reformulation (see below). 

However, it is easily seen that this treatment can give only the part of the level 
shift that is due to the interaction with virtual photons of sufficiently high 
frequency. Let us consider, for example, the next radiative correction (as regards 
order with respect to the external field) to the electron scattering amplitude: 


(123.2) 


(unlike (121.2b), this diagram contains two external-field vertices). In the range of 
integration over d*k where ky is sufficiently large, this correction involves an extra 
power of Za, and is therefore unimportant. But the addition of a second external- 
field vertex to the diagram also brings in a further electron propagator G(f). When 


+ The calculation of the radiative corrections for processes which appear only in the second 
approximation of perturbation theory is considerably more laborious, and will not be given here. We shall 
simply list some references: L. M. Brown and R. P. Feynman, Physical Review 85, 231, 1952 (radiative 
corrections to photon scattering by an electron); I. Harris and L. M. Brown, ibid, 105, 1656, 1957 (r.c. toa 
two-photon pair-annihilation); M. L. G. Redhead, Proceedings of the Royal Society A220, 219, 1953, and R. 
V. Polovin, Soviet Physics JETP 4, 385, 1957 (r.c. to electron scattering by an electron or a positron); P. I. 
Fomin, ibid. 8, 491, 1959 (r.c. to bremsstrahlung). 

¢ The shift of the hydrogen levels was first calculated by H. A. Bethe (1947) with logarithmic accuracy, 
using a non-relativistic treatment; this work provided the initial stimulus for the whole subsequent 
development of quantum electrodynamics. The difference between the 2512 and 2p levels (in the first 
non-vanishing approximation of perturbation theory) was exactly calculated by N. M. Kroll and W. E. 
Lamb (1949); the complete formula for the level shift is due to V. F. Weisskopf and J. B. French (1949). 
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k is small, and the free ends p and p’ are non-relativistic, the important values of 
the virtual-electron momenta f are those close to the pole of the propagator G(f). 
The small denominator which thus occurs cancels the extra small factor Za. The 
same evidently applies to the corrections of all orders with respect to the external 
field. Thus, at low frequencies of the virtual photons, the external field must be 
taken into account exactly. 

We can divide the required level shiftt 5E, into two parts: 


SE, = SE + 5E%, (123.3) 


which originate from the interaction with virtual photons having frequencies in the 
ranges (I) ky) > « and (II) ky) << «; k is chosen so that 


(Za)’m <x <m, (123.4) 


where Z’a’m is of the same order as the binding energy of the electron in the 
atom. Then, in region I, it is sufficient to take account of the nuclear field in the first 
approximation. In region II, the nuclear field must be treated exactly, but on the 
other hand, since x <m, we can solve the problem in the non-relativistic ap- 
proximation—not only as regards the electron itself, but for all the intermediate 
states. With the condition (123.4), the ranges of validity of the two methods of 
calculation overlap, and it is therefore possible to make an exact “joining” of the 
two parts of the level correction. 


THE HIGH-FREQUENCY PART OF THE SHIFT 


Let us first consider region I. Here it is possible to use the correction (122.1) to 
the scattering amplitude, after removing the contribution of the virtual photons 
which pertain to region IJ. These make only a small contribution to the form factor 
g, which therefore can be left unaltered. The low-frequency virtual photons make a 
large contribution to f, however, because of the infra-red divergence. Thus f in 
(122.1) must be taken as a function f, from which the region ky<« has been 
excluded. 

This could be done directly by subtracting from f the integral over the region 
ky <x, but the required result can be obtained without fresh calculations by using 
the results of §122. To do so, we note that the exclusion of frequencies ky < « can 
be regarded as one possible type of infra-red cut-off. The result for the correction 
to the scattering cross-section must, of course, be independent of the cut-off used, 
provided that the real soft photon emission probability is cut off in the same way, 
i.e. the concept of “elastic”? scattering includes the emission only of photons with 
frequencies from x to the specified wma. If we take wmax =k, there is no need to 
take explicit account of the photon emission. Hence we see that f, is obtained from 
the f,.. determined in §122 by simply replacing wma by «. In particular, in the 


+ In this section, E, denotes the energy of an electron in an atom, not including its rest energy. The 
suffix s stands for all the quantum numbers which define the state of the atom. 
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non-relativistic case 


ee +54) 
fo-1=- 24, 3 (log 5 5g (123.5) 


Let us now transform the correction (122.1) to the scattering amplitude by 
representing it as the result of a corresponding correction to the effective potential 
energy of the electron in the field. Comparing the amplitude (122.1) 


se e(u'* QaaPu) 
with the Born scattering amplitude (121.6) 
—e(u'*®u), 


we see that the correction is given (in the momentum representation) by the 
function 


ed D(q) = CQraa(q)P(q). (123.6) 


In the non-relativistic case, taking P and g from (113.14) and (117.20), and 
substituting f, from (123.5) for f, we get 


2 
_f__eq (,.,m 4-4) a, 
5(q) = | - 524 (log +55-5) + gon a yfo. (123.7) 


The corresponding function 6@(r) in the coordinate representation ist 


114 
8@(r) = =" “(log x ga +575) MOO) ig 


vy: V(r). (123.8) 


The level shift 5E® is found by averaging e5®(r) over the wave function of the 
unperturbed state of the electron in the atom, Le. as the corresponding diagonal 
matrix element:+ 


(ss €a il 5) = 
oP: Som (Io 085, +94 ~5)X8140ls) 


ea 
ig (s|y - VOls). (123.9) 


In the first term, the non-relativistic electron function suffices for the averaging. 


+ Note that this correction to the potential is not the same as the one discussed in §114, which 
included only the effect of the vacuum polarization (diagram (121.2a)) on the Coulomb field as such. The 
correction (123.8) relates to the interaction of the field with the electron, and includes also the effect of a 
change in the motion of the electron (diagram (121.2b)). 

+ Strictly speaking, the form factors determined in §117 related to the vertex operator with two 
external electron lines (p*= p” =m’). For an electron in an atom, the energy E; is a level which is 
unrelated to p. The distinction may, however, be neglected in region I. 
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In the second term, this approximation is insufficient: the zero-order approximation 
with respect to the non-relativistic functions is zero on account of the absence of 
diagonal elements in the matrices y. Here, therefore, we must use the approximate 
relativistic function 


derived in §33, retaining the components y which are small (in the standard 
representation). We have 


w*yb = b* ax —x* oh 


and, substituting from (33.4) 


we get, using the identity (33.5) and integrating by parts, 
(sly -V@ls)=—5 | {6 *(o  VO\(o V4) + (VO* oo -VO)4} dx 
~ 5 | {b*A®- b —2io - h*[V® X VAb]} dex. 


Since ® = MD(r), 


_rd® 
ar Te 
and hence 
_1d® A 
—io [V@xV]=- Fad i. 
where 1=—ir XV is the orbital angular momentum operator. Finally, bringing 


together the expressions obtained and substituting in (123.9), we have 


1d® 
r dr 


(I) e? m 19 e° 
6b. =z (log so +35 (s|A®|s) +7 Sia: l S ), (123.10) 


in which the averaging is over the non-relativistic wave function in both terms. 


THE LOW-FREQUENCY PART OF THE SHIFT 


In order to calculate the second part of the level shift, we use a technique based 
ultimately on the unitarity condition. 
Since a photon can be emitted, the excited state of the atom is not strictly 
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stationary, but only quasi-stationary. A complex energy value can be assigned to 
such a State, its imaginary part being —3w if w is the decay probability of the state, 
in this case the total photon emission probability (see QM, $134). In the non- 
relativistic approximation there is dipole radiation, and from (45.7) 


im dE, aaa t eae sw, = — 4 Ss ld.s\°CE, —= E,)’, 


where the summation is over all the lower levels (E, < E,), or, equivalently, 
im SE, = —3 i deo - 3. |dw CE, ~ E,)°8(E, — Ey — 0). (123.11) 
0 Ss 


In order to find the real part of 6E,, we must regard E, as a complex variable 
and use analytical continuation. This may be done by treating the delta functions as 
originating from poles. The rule for the avoidance of poles is, as usual, specified by 
adding a negative imaginary part to the masses of the virtual particles—in this case, 
to the masses m, of the electron in the intermediate states of the atom. These are 
m, =m+tE,, and so we must put 


By — Ey: = i0, 
whence 


l 


ite es oe ee 
Oe Es Oe ee 0" 


(123.12) 


cf. (111.3). 
Thus, substituting (123.12) in (123.11), we find 


(E, Zz. Ey 


: ea ee Ce ene) 
im 6E, =im | de 2, lds" E.-E.cw ti0 
0 


The required analytical continuation is now obtained by simply omitting the symbol 
im, but we must take from 6E, only the part due to the contribution from 
frequencies in region II (w <«). To do so, we need only replace the upper limit of 
integration by «. The result of the integration is 


K 
eg ae (123.13) 


2 
§sE™) == a= > \d.<°(E,y _ E,) log 


because of the inequality (123.4), the difference E, — E, is neglected in comparison 
with « at the upper limit. We shall henceforward be concerned only with the real 
part of the level, which is obtained by using «/|E,—E,| as the argument of the 
logarithm in (123.13). 
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In the expression (123.13), the term in iog k can be transformed by replacing the 
matrix elements of the dipole moment d=er by those of the momentum p= mv 
and its derivative p: 


2 
é 
> \d../(Es = E,)° a m. > pss CEs = F,) 
ie’ 
a Im? > {(p)ss° * Ps’s — Pss’ * (p).'s}. 


Now replacing p in accordance with the operator equation of motion of 
the electron, p= — eV®, we get 


3 
e 
> ld,,PCEs: a E,)° <=. at >» {(V®),5 * Ps's — Dss’ * (V®),:5} 
ie? |. 
= 52 (5|P -V—-V9 - pls) 
e? 
= 5773 (s|A®|s). (123.14) 
We can therefore write instead of (123.13) 
SEY = sem (s|A®|s) log ate + 
>  - 3am m 
ey |r (Ey — E y ieee (123.15) 
377 < = . : 2\E, = E,|° 
THE TOTAL SHIFT 
Finally, adding the two parts, we have the following formula for the level shift: 


_ 2e 2 3 ise 
bE, = 30 2 Irss'|CEs — Es) log 2|E; — E,.| . 


e> 19 _ @ [d®| \. 
+ 4-75 (81AGls) = 79 (5 Cote s): (123.16) 


as was to be expected, the auxiliary quantity k does not appear.t 

All the matrix elements in (123.16) are taken with respect to the non-relativistic 
wave functions of the electron in the atom. For a hydrogen atom or a hydrogen-like 
ion, these functions depend only on three quantum numbers: the principal quantum 
number n, the orbital angular momentum | and its component m, but not on the 


+ The determination of the next-order corrections in the level shift involves very complicated 
calculations. The most complete tabulation and a systematic derivation of the corrections, together with 
further references, are given by G. W. Erickson and D. R. Yennie, Annals of Physics 35, 271, 447, 1965. 
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total angular momentum Jj; the corresponding energy levels depend on n only. We 
shall use the notationt 


3 2\2 
= n Ut anyt 2 — 3 m(Ze ) 
Ln = SaeZeRe Di Kn'l'm'|r|nim) (Ey — En)? * log EE 123.17 


The energy levels are proportional to (Ze’)’, and the characteristic dimension of the 
atom is proportional to Ze’, so that the L,, defined by (123.17) are independent of 
Z. They can be calculated numerically. 

We shall take separately the cases 1! =0 and 14% 0. When | = 0, the last term in 
(123.16) is zero. In the second term, we use the equation 


eA® =47Ze’S(r), 
which is satisfied by the potential of the Coulomb field of the nucleus. Hence 


(nlm|A®|nlm) = 42Ze?|Wnim(0)|? 
7 ‘ii (1 = 0) 
0 (1 # 0) 
(cf. (34.3)). In the first term, with the notation (123.17) and again using (123.14), 


> {(n'l'm'|r|n00)|(Ey — En)? = a (n00|A@|n00) 
n'l'm' 


= 2m(Ze’)*/n?. 


This gives the following expression for the shift of the s terms (in ordinary units): 


_ 4mc’?Z*a° 1 19 
SEno = 37 log Fay t bw + 35 | (123.18) 
The numerical values of some of the Lino are: 
n 1 #4 3 4 00 


Lio: “SH 2984 = 2312. =—2,768. — 2750" —292) 


The unperturbed levels are E, = — mc?(Za)’/2n’, and so the relative magnitude of 
the radiative shift is 


|SEno/Eno| ~ Z’a° log (1/Za). (123.19) 


When | ¥ 0, the second term in (123.16) is zero. The third term can be calculated 


+ The matrix elements of r are diagonal in j and independent of j; the summation over s in (123.16) 
therefore reduces to summation over n, | and m. Because of the isotropy of space, the sum (123.17) is also, of 
course, independent of m. 
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by means of the formulae in §34, and leads to a dependence of the level shift on the 
number j also. The result is 


— 4mc?Z*a° 3jG+1)-114+ 1)-3 
Oba = 3an? nl + orep | lx (0. (123.20) 


Thus the radiative shift removes the last degeneracy which remains after the 
spin-orbit interaction has been taken into account, namely the degeneracy of leveis 
having the same n and j but different | = j +4. For example, the numerical value of 
L»; is + 0.030, and formulae (123.18)—-(123.20) give as the difference of the 25). and 
2Pi2 levels of the hydrogen atom 


= 25 
Fo 1/2) — Eni) = 0.41mc*a’, 


corresponding to a frequency of 1050 MHz. 


§ 124. Radiative shift of mesic-atom levels 


At the end of $118 the electron vacuum polarization has been shown to play an 
important role in the (second-approximation) radiative correction to the magnetic 
moment of the muon. This still more true (and even in the first approximation) as 
regards the radiative level shift in w-mesic hydrogen, a hydrogen-like system 
consisting of a proton and a muon (A. D. Galanin and I. Ya. Pomeranchuk, 1952). 

In calculating the level shift for an ordinary atom in §123, we took account, in 
particular, of the electron vacuum polarization effect (the electron loop in the 
diagram (121.2a)). If the muon vacuum polarization effect is similarly treated in the 
mesic atom, the entire calculation can be applied to this case, simply replacing the 
electron mass m = m, by the muon mass m,. Since the relative shift (123.19) of the 
levels does not depend on the electron mass, the same result is obtained for mesic 
hydrogen. 

It is easily seen that the electron vacuum polarization has a much stronger 
effect on the level shift in the mesic atom, because the replacement of the muon 
loop in the diagram by an electron loop implies the replacement of the muon 
polarization operator by the electron polarization operator; and the polarization 
operator P(q’) is inversely proportional to the square of the particle mass for 
non-relativistic values of q*. Hence the change mentioned must increase the effect 
by a factor (m,/m,)’, and it is this contribution which determines the order of 
magnitude of the level shift: 


SE/|E| ~ a3(m,/m.), 


or four orders of magnitude greater than in ordinary hydrogen.t The origin of this 
effect can be more clearly seen by noting that the distortion of the Coulomb 


+ For a similar reason, the contribution of the muon vacuum polarization to the level shift in the 
ordinary hydrogen atom is, conversely, negligible. 
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potential by the electron vacuum polarization extends to distances ~ 1/m, (8114). 
In the ordinary hydrogen atom the electron is at distances from the nucleus that are 
of the order of 1/m.a, i.e. outside the main region of field distortion, but in mesic 
hydrogen the muon is at distances ~ 1/m,a which are in this region. 

To calculate precisely the level shift in the mesic atom, however, it is not 
possible to use the approximate non-relativistic expression for the polarization 
operator, aS was done when using (123.7) to find the level shift in the ordinary 
atom. The reason is that the characteristic momenta of the muon in the mesic 
hydrogen atom are |p,|~ am,. For the muon these momenta are non-relativistic, 
but for the electron they are relativistic. 

We must therefore use the full relativistic formula (114.5) for the effective 
potential of the nuclear field as modified by the electron vacuum polarization. The 
level shift is found by averaging over the wave function of the muon in the atom: 


$= el | I l?8@(r) dx 


=~le| { Ri(r)o@(ryr? dr, (124.1) 


where R,, is the radial part of the (non-relativistic) Coulomb wave function. For a 
hydrogen-like ion with nuclear charge Zle|, the functions R,,(r) depend on r only 
through the dimensionless combination p = Zam,r (the distance in Coulomb units). 
Using this fact and substituting 6®(r) from (114.5) (with the charge Zle| in place of 
e,), we can bring the integral (124.1) to the form 


sEg== = Zam, Qu(m./Zam,), (124.2) 


where 


onts)= fod f Rite (10g) MED a 


The first few levels of mesic hydrogen are shown by numerical evaluation to have 
the following relative shifts: 
BE jol| E40] = —6.4x 10%, 
SE 9/| E 9] = 2.8X 10%, 
6Ey/|E>)| ar? ae 2.0X 10°, 
§ 125. The relativistic equation for bound states 
The method used in the preceding sections to calculate the radiative shift of the 


atomic levels is not valid for solving a problem such as that of determining the 
corrections to the levels of positronium, a system consisting of two particles of 
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equal rank, neither of which can be regarded as the source of the external field 
acting on the other. 

The systematic procedure for solving this problem is based on the fact that the 
energy levels of the bound states are poles of the exact amplitude of mutual 
scattering of these two particles, as a function of their total energy in the 
centre-of-mass system. In any of its discrete states, positronium may be regarded 
as an “intermediate particle’? having a definite mass, which can be formed as a 
stage in the electron—positron scattering process, and a pole of the scattering 
amplitude corresponds to each “one-particle” intermediate state; these poles of 
course lie in the non-physical region of 4-momenta of the particles undergoing 
scattering. 

According to (106.17), the exact scattering amplitude comprises the exact 
four-ended vertex part I’,i. and the polarization amplitudes u of the particles. The 
latter are clearly unconnected with the pole singularities, and it is therefore more 
convenient to ignore them, referring instead to the poles of the vertex part itself, 
i.e. of the function 


tin. pas ps: — p+), (125.1) 


where the notation for the 4-momenta of the external lines of the diagram (106.12) 
corresponds to the scattering of a positron by an electron. 

It should be stressed that the assertion that poles are present refers to the exact 
scattering amplitude or the exact vertex part; there is no pole in any separate term 
of the perturbation-theory series, as can be seen from the fact that the Feynman 
diagrams in each approximation include only electron (and photon) lines, not lines 
belonging to the composite particle positronium as a whole. Hence it follows in 
turn that the calculation of the scattering amplitude near its poles involves 
summation of an infinite series of diagrams. The diagrams concerned can be 
determined as follows. 

In the first non-vanishing approximation of perturbation theory (the first ap- 
proximation with respect to a), the vertex part (125.1) corresponds to two second- 
order diagrams: 


pe p. -P, P- 
ne. ol a all (125.2) 


] P-PL (PtP, 
oe ae p pt 
a b 
or, in analytical form, 
L cin re e yi yinmD,(p- = p-) ve ery yiiDuy(p- + p+). (125;3) 


In the next approximation (the second with respect to a) there are ten 
fourth-order diagrams: 
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(125.4) 


and a further five diagrams obtained from (125.4) by interchanging p_ and — p:. All 
these include an extra power of e*>=a in comparison with (125.2), but we shall 
show that in diagram (a) this extra order of smallness is cancelled by a denominator 
which is also small when the electron and positron momenta are small. 

All quantities will be taken in the centre-of-mass system, but, since the 
4-momenta of the external lines in the diagrams are not assumed to be physical (i.e. 
p’? xm’), «. # e- in this system, although p, = — p_. Thus these 4-momenta are 


p-=(e-,p), ps = (€1,—p), 
p.=(e!,p’) pi =(e.,-p’), (125.5) 


ben See. 
The binding energy of the electron and the positron in positronium is ~ ma’. Thus, 
in the neighbourhood of the scattering-amplitude poles, with which we are con- 


cerned, 


Ip| ~ |p'| ~ ma <m, 


le_—m|~|e,—m|~ p’/m~ ma’,... (125.6) 


The contribution to the vertex part from the diagram (125.4a) is 


Pn = — je’ | (y‘G(q)y" )aly’G(q 2D = Da) ie OX 
x D,,(q — p!)D,..(p-— q) d*q/27)’. (125.7) 


The important range of values of q" = (qo, q) in the integral (125.7) is that which is 
close to poles of both functions G simultaneously. In this range, |q| and [qo — m| are 


PA 
can 
tA 
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small, and the electron propagators are 
eeee eee eee 
do— m —(q’/2m) + i0’ 


l 
qo — €-—€+ +m +(q’/2m) — i0 


0 
— do-y:qtrm 1,0 
G(q) =e 4 = Hy +1) 
+m —m)—q' +i0 
(qo )(o q (125.8) 


G(q — p-— ps1) ~x(y"— 1) 


The poles of these two expressions are on opposite sides of the real axis of the 
complex variable qo; closing the path of integration along this axis by a contour in the 
upper half-plane (say), we can calculate the integral over dq from the residue at the 
corresponding pole.t The result is 


peo ef —— ae od 
(q —p!)(p-— q)(2m — e- — e+ + q’/m)’ 


and so, using (125.6), we have in order of magnitude 


3 
po” shay ae (ma) _ I 
(ma) ma? ma’ 


The contribution to I from the second-order diagram (125.2a) (the first term in 
(125.3)) is of the same order, and this proves the statement made above about the 
order of smallness of the diagram (125.4a). A similar situation occurs in all higher 
approximations of perturbation theory. 

Thus the calculation of the relevant vertex part near its poles calls for the 
summation of an infinite succession of ‘‘anomalously large’? diagrams with inter- 
mediate states resembling the internal lines of (125.4a). A typical property of these 
diagrams is that they can be cut between the ends p_, — p; and p!,—p: into parts 
joined only by two electron lines.¢ The set of all diagrams which do not satisfy this 
condition will be called a ““compact”’ vertex part and denoted by Le. since it does 
not include the anomalously large diagrams, such quantities can be calculated by 
ordinary perturbation theory. For example, in the first approximation I is given by 
the two second-order diagrams (125.2), and in the second approximation by the 
eight fourth-order diagrams in (125.4), excluding diagrams (a) and (b). 

If the non-compact vertex parts are classified according to the number of 
“double bonds’’, we can represent the total I’ as an infinite series: 


"aC: J+ cel cece om 


where the continuous thick internal lines are exact propagators @Y; this is often 


+ For the diagram (125.4c), which differs from (125.4a) only as regards the relative direction of the 
electron lines, both poles would be on the same side of the real axis, and so the integral would be zero in 
the approximation considered. 

t This definition includes all the anomalously large diagrams and also some ‘“‘normal’’ diagrams such 
as (125.4b). 
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called a “‘ladder’’ series. To sum the series, we ‘‘multiply”’ it on the left by a further 


Comparison with the original series (125.9) shows that 


Pp q p— p-. p— Pp. p_ 
OK KL ts 
P, OPeP Py —Pp; a =p; —P, 


This graphical equation corresponds to the integral equation 
iT ik.im(P’,— D+3 P-»— D4) = iP im(P ! — Pss Ps — BS) + 
= | Vinsm(P!, @ — P4— p's G.— P1)Ga(G) Gurl — p4 — p') x 
x Tiin(Q, — P+ P-» 4 — p+ — p*) d*qi(2rr)J*. (125.11) 


The functions I and are calculated by perturbation theory, and equation (125.11) 
then allows, in principle, the determination of I with any desired accuracy. 

To find the energy levels, we need to know only the positions of the poles of I. 
Near the poles, [>I', and so the first term on the right of (125.11) (the second 
diagram from the right in (125.10)) may be neglected, the equation then becoming 
homogeneous in I’. The variables p,, p- and the indices k and | become parameters, 
the dependence on which is arbitrary and is not defined by the equation itself. 
Omitting these parameters and also the primes in the remaining variables p:, p', we 
have 


Tim P-s— De) = | Pron P- q — P+ — D-3 G, — P+) Gse(q) X 


X Gq — p+— PV in(q3 q-—P+—p-)d*ql(2m)* (125.12) 


(E. E. Salpeter and H. A. Bethe, 1951). 

Equation (125.12), written in the centre-of-mass system (p;+p-=0), has solu- 
tions only for certain values of ¢,+e-_, and these give the positronium energy 
levels. The function I, plays only an auxiliary role. Another function is more 
convenient in practice: 


Xsr(P 1, P2) a Gs (D OV in (P13 D2) Gnr(P2). (125.13) 


+ That is, we multiply each term in the series by I and two %, and integrate appropriately over the 
4-momenta of the new internal bonds. 
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Then equation (125.12) becomes 
iLG"'(p_-)x(p-,— D1) G"(— Dim 


= | Vinsm(P-» 4 — P+ — P-3 Gs — P)Xsr(Q, Q — Pp» —p-) d*q/(2m)*, (125.14) 


where I appears as the kernel of an integral operator. As already mentioned, f° may 
be calculated by perturbation theory, and the same is of course true of G”’. 

We shall show that, in the first approximation of perturbation theory (with 
respect to a), (125.14) reduces, as we should expect, to the non-relativistic 
Schrodinger’s equation for positronium. 

In the first non-relativistic approximation, [ is determined by the diagram 
(125.2a) alone; the annihilation-type diagram (125.2b) is zero in this ap- 
proximation.{t For a similar reason to that in §83, it is convenient to take the photon 
propagator in the Coulomb gauge (76.12), (76.13), and only Doo need be retained in 
it. Then 


Tinsm(D-, 4 — P+— P-3 4, — P+) = — ©'V Vim Doolq — p-) 
ae U(q = P_)VicVrms 
where 
U(q) = — 47e’/q’ 


is the Fourier component of the potential energy of the Coulomb interaction 
between the positron and the electron. Equation (125.14) becomes 


IVim(p-, — P+) 
=| Gordy" | Ua@-pixaa~p.— PIG VG po], (125.15 


im 


where we have also replaced the exact propagators @ by the free-electron pro- 
pagators G. The latter are given by the approximate expression (cf. (125.8)) 


G(p_)=21+ y)g(p-), G(- p1) ~2(1— y")g(p4), 
where the matrix factors have been separated, and g(p) is the scalar function 


1 


aD) = Gye 10) (125.16) 


In substituting these expressions in (125.15), we note that all the non-zero matrix 


+ The particle velocities in positronium are such that v/c ~ a. In this sense the expansions in powers 
of a and of 1/c are interrelated. 


QE4 - KK 
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elements 
Bd + y)y°vy? 30 — yim = BCy? + Dy - 307° - Dim 


are equal to the elements — Yim. The matrix equation (125.15) is therefore equivalent 
to one for the scalar function 


ix(p-, — p+) = — 8(p-)g(P+) | U(q-p_)x(q, q— p+— p-) d*qi2a)’. (125.17) 
We now replace p, and p- by the variables 


p =(e,p)=2(p-— ps), P =p_t+ps; 


these are the 4-momentum of the relative motion of the particles and that of the 
positronium as a whole. In the centre-of-mass system, P =(E+2m,0), where 
E+2m is the total energy, and E therefore the energy level relative to the rest 
mass. In terms of these variables, (125.17) becomes 


iv(p, P) =— g(p +3P)g(—p +4P) | U(q—p-_)x(q —3P, P) d*q/(2a)* 
=~ g(p +3P)g(—p +4P) | U(q'—p)x(q', P) d*q'I(2m)*. 


In this equation, P occurs only as a parameter, and y figures on the right-hand side 
only as the integral 


[e,2) 


wa) = | x(a P) dao 


—& 


Integrating both sides of the equation over de, we get an equation for in a closed 
form: 
(p) = _- | g(p +3P)g(—p +P) de | va-ni@es 
27 (27r)” 


where 


1 
oie 


If the path of integration over de is closed by a contour in the upper half-plane 
(say) of the complex variable c, we can evaluate the integral from the residue at the 
corresponding pole, obtaining 


P- d*q 
(P-E)np+ | U@-ona 5-4=0. (125.18) 
This is Schrédinger’s equation for positronium in the momentum representation; 
see QM, (130.4). 
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If only the diagrams (125.2) were used in I’, but including in them (and in G) the 
next terms in the expansion in powers of 1/c, we should arrive at Breit’s equation 
(§83). The inclusion of the diagrams (125.4) (together with the further terms in the 
expansion in 1/c) gives the radiative corrections to the positronium levels, but the 
calculations become very complicated. 

The following is the difference between the ground levels of ortho- and 
para-positronium, including the above-mentioned corrections:T 


E(?S1)— EC'S») = a 


, me‘ (2 (‘2 
9 


ule eee ed amen Ca 
mat [g— (+ 1og2) = -tia (125.19) 


the first term in the braces is the fine splitting (see §84, Problem 2). The second term 
is the radiative correction to the difference between the levels. The imaginary part 
of the difference arises from the parapositronium annihilation probability (see 
(89.4)), i.e. from the fact that the level ‘So is complex; for parapositronium, the 
level width is found to be of the same order as the radiative correction to the real 
part of the level. 


§ 126. The double dispersion relation 


After the vertex part with three external lines, the next in order of complexity is 
a section with four external lines. In quantum electrodynamics, three basic 
diagrams of this type are possible: 


\ of x. 7 

im | (126.1) 
7 \ Z _ 
a * c b Cc 


The first describes the scattering of a photon by a photon; the others are individual 
terms in the radiative corrections to the scattering of (b) a photon and (c) an 
electron, by an electron. 

This §126 deals with some general properties of such diagrams, but to be simple 
and specific we shall refer only to (126. 1a). 

The momenta of the lines in such a diagram will be denoted as follows: 


k=ktkyk, k, 
X\ 7 
ey, 
sail i, (126.2) 
- q-k;k, 
ke NG 


+ R. Karplus and A. Klein, Physical Review 87, 848, 1952. 
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The 4-momenta k,, k., k3, ks correspond to real photons, and their squares are 
therefore zero. 

If the dependence on the photon polarizations is written separately, the am- 
plitude M,,; which corresponds to the diagram (126.2) can be expressed in terms of 
various scalar functions of the photon 4-momenta. These are the invariant am- 
plitudes discussed in §70; they will be derived in §127 for the specific case of 
photon—photon scattering. Being scalars, they depend only on scalar variables, 
which may be taken as, for example, any two of the quantities 


s=(k, +k), t=(k,-k)’, u=(ki-ky’, s+tt+u=0; (126.3) 


in what follows we shall take s and t as independent variables. 
Each of the invariant amplitudes, which will be denoted here by the same letter 
M, can be written as an integral: 


M - | iB d*q 
[q°— m*}I(q — ks)’ — m*][(q — ky — kay’ — mq — kx)’ — m*V) (426.4) 


m?>—>m?’— i0, 


where B is some function of all the 4-momenta; the factors in the denominator 
arise from the propagators of the four virtual electrons. 

When s and ¢t are sufficiently small, the amplitudes M are real (more precisely, 
they can be made real by a suitable choice of the phase factor), since if s is small 
the photons cannot generate real particles (an electron—positron pair) in the s 
channel, and if t is small the same applies to the t channel.t Thus neither channel 
has real intermediate states which could, according to the unitarity condition, lead 
to an imaginary part of the amplitude. 

Now let s increase while t remains at a fixed small value. When s = 4m’, the 
amplitude M has an imaginary part due to the possibility of pair production by two 
photons in the s channel. We can therefore write for M a dispersion relation in the 
variable s: 


- a Ais s', t) j 
M(s, 2 | ee eee ds’, (126.5) 
4m? 
where A,,;(s, t) denotes the imaginary part of M(s, f). 
As in any diagram having the form 
k, q k, 
k; q-k-k, k, 


+ The directions of the external lines as shown in the diagram (126.2) correspond to the s channel. In 
the t channel, lines 1 and 3 are incoming, and so the 4-momenta of the initial photons are k; and — k3. 
The physical regions for photon—photon scattering in the variables s, t, u are the shaded sectors in Fig. 8 
($67). For example, the s channel corresponds to the region s >0, t<0, u <0. 
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Ais(s, t) is calculated by the rule (115.9), replacing the pole factors in the integral 
(126.4) by delta functions: 


iBS(q° a m*)d[(q — ky — ky)’ — m?] 


2A W(t) = Omi? | TE mG Balm 


d‘p: (126.6) 


the integration is taken over the half of q-space in which q°’>0. 

An important further step can be taken by noting that the integral (126.6) has a 
structure (of pole factors) similar to that of the amplitude for a reaction represen- 
ted by a diagram having the form 


k, q~k, k, 


k, q~k, ky 


The analytical properties of A,,(s,t) as a function of t are therefore similar to the 
analytical properties of this amplitude. In particular, the function Aj,(s, t) can 
acquire an imaginary part (as t increases) only if both factors in the denominator 
become zero simultaneously. This will not, however, occur as soon as t reaches the 
value 4m’ which is the threshold for pair production in the t channel. The reason is 
that the presence of the delta functions in the integrand restricts the region of 
integration in q-space, which may be incompatible with the value t =4m’. The 
extent of the region of integration depends on s (the arguments of the delta 
functions contain k, and k,), and therefore so does the limiting value t = t.(s) 
beyond which A,,(s, t) becomes complex. 

In the same way as M(s,t) is expressed in terms of its imaginary part Aj,(s, f) 
by (126.5), the function A,,(s,t) is in turn expressed in terms of A,(s,t)=1im 
A1,;(s, t) by a dispersion relation in the variable t: 


oO 


1 ¢ Adxst') ,, 
Ai.(s, 1) =— AAD att (126.7) 


t-(s) 


If we now substitute (126.7) in (126.5), we get the double dispersion relation or 
Mandelstam representation for the amplitude M(s, t): 


_t As’, t’) ' ! 
M(s,t)=—5 | | oie 4 ds (126.8) 


4m? t,(s) 


(S. Mandelstam, 1958). 

The function A,(s,t) is called the double spectral density of M(s,t). It can be 
obtained from the integral (126.6) by twice applying the substitution rule (115.9). 
Putting for brevity 


h=q h=q-ky, b=q-k, = q-ki—k, (126.9) 
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we have 


(21° A,(s, t) = Qniy' | iB (1? — m?)8(13 — m?)8(13 — m2)8(12— m2) d‘q, 
(126.10) 


the integration being taken over the region q’ > 0. 

It should be noted, however, that formula (126.10) is purely symbolic, since the 
region s > 0, t > 0 1s non-physical, and accordingly I|,, l,... are in general complex 
in this region when q is real; and the delta function is not fully defined for a 
complex argument. It would be more accurate to refer immediately to the taking of 
residues at the corresponding poles of the original integral (126.4). In our case this 
is, however, of no importance. The condition for the four expressions in the 
denominator in (126.4), or the four arguments of the delta functions, to be zero, 
entirely determines the components of the 4-vector qg. On changing to integration 
with respect to Ij, 15,... (see below) and formally applying the usual rules to 
(126.10), we obtain (apart from the sign) the expression for Ab. 

To continue the calculations, we use the centre-of-mass system (in the s 
channel). Then 


ky = (w, k), k» = (w, _ k), k3 = (w, k’), kK, ve (w, = k’), (126.11) 


s=4@*, t=—(k—k'f~ =—4o’ sin’ 56, 


u = —(k+k')’ = — 4m’ cos’ 48, 


(126.12) 


where @ is the angle between k and k’ (the scattering angle). The x-axis of spatial 
Cartesian coordinates is taken along the vector k +k’, and the y-axis along k —k’.f 

We shall now transform the integral (126.10) by taking I7, [,... as new variables 
of integration in place of the four components of q. Then 


a(lplaq" =2h,,..., 
and the Jacobian of the transformation is therefore 


a(1i, 13, 3, 1) 
2 <2s 3 “4 = 16D, 
a(q°, qx, dy, q:) 


where D is the determinant formed by the sixteen components of the four 
4-vectors l,, L,.... The integration in (126.10) amounts simply to replacing the 
functions B and D in the integrand by their valuest when 


=== =m’. (126.13) 


+ When t >0, (k—k’')’<0, i.e. the vector k—k’ is imaginary. This difficulty is, however, easily 
circumvented by expanding all vector expressions with t <0 and using analytical continuation to t > 0. 

¢ This method of integration automatically takes account of only one zero of each argument of the 
delta functions. 
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From the conditions Ij = 1} = m? we have, as in §115, 
qQ=0, G=o~-m. (126.14) 
The other two conditions give 
(q —k,)’ — m? = — 2qk, = —2w’- 2q-k' =0, 
(q —k,)’-— m* = —2w*-2q:k=0, 
and hence 
q:k=q-k'=~4s, 
or, in components, 
Q=o, gr=—s/2%(stt), gy =9, 
qz = + V(w? — m?— qi) 


_ . fst -—4m7(s + t)]? 
a ericess ee a | (126.15) 
Thus the integral (126.10) is 
ar‘ 
Ax(s, t) = Fy D (— iB), (126.16) 


where the summation is over the two values of q given by (126.15). 
The determinant D can be written in terms of the antisymmetric unit tensor: 


D = Cyypol fIZBUG 
= — Cups qQ*kakski 
=— €uvpo(q = k)*(k4 = k 1)" (ky a k\)’k?, 


where the antisymmetry of e,,,, has been used. Since only k; among the four 
factors has a time component, we deduce that 


D = — oq: (k+k’) x (k—k’). 
Expanding this expression with t <0 and then continuing to t > 0, we find 
D =— gq, V(s + t)V(— t) > + 4i{st[st — 4m7(s + t)}}!”. (126.17) 
The choice of sign needed here can be made as follows. For simplicity, let 
B =1. Then A,,(s, t)<0 in the physical region (s > 0, t <0), since the two factors 


in the denominator in (126.6) have the same (negative) sign: 


(q — ks)’ — m* = — 2w*— 2q- kk’ < — 20(@ — |q|) <0, 
(q — ky — m? = — 20? —2q +k <—20(w —|q|) <0 
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(here we use the results (126.14) which follow from the presence of the two delta 
functions in the numerator, and which show that |q| < w).+ From (126.7) it is then 
seen that A.2(s, t) also must be negative when s >0 and t > 0 (since, as is evident 
from (126.16), A2(s, t) does not change sign). This means that the upper sign must 
be taken in (126.17), giving finally 


SB 


Ay=— 1" {st{st —4m7(s + t)]}"" ize.) 


Since, from its significance, A2(s, t) must be real, there is a further condition: as 
well as s and t, the expression in brackets in the denominator must be positive: 


st -4m*(s +t) =0, | 

s>0, t>0. vn 
These inequalities define the region (shaded in Fig. 23) over which the integration is 
to be taken in the double dispersion integral (126.8). The region is bounded by the 
curve 


st -4m7(s +t) =0, 


with asymptotes s = 4m’ and t = 4m’. 

The dispersion relations in the form (126.5) and (126.8) do not yet take account 
of the renormalization conditions; if they were applied as they stand, the integrals 
would be divergent and would need to be regularized. The renormalization con- 
dition for the amplitudes M(s, t) is 


M (0, 0) = 0: (126.20) 
the photon-photon scattering amplitude must be zero when k, = k,;=k;=k,=0 


(and therefore s = t = 0), since k = 0 implies a potential constant in time and space, 
corresponding to no physical field; this condition will be further discussed in §127. 


Fic. 23. 


+ This is, of course, not fortuitous: Aj, is negative, in fact, because of the unitarity condition, as is 
especially clear when t = 0 and Aj, determines the total cross-section. 
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To include this condition automatically, we must write the dispersion relation 
“with subtraction” (as in deriving (111.13) from (111.8)). The required relation is 
obtained in a natural manner by first using an identical transformation: 


1 ue st S t 1 
(s'—s)(t'—t) (s'—s)(t'—t)s't' . (s'—s)s't’ _ (t'—t)s't’ s't” 


Substitution of this in the integrand (126.8) gives 


_ St (s',t')ds'dt' | s f f(s')ds' , t f g(t’) dt’ 
MNS At) = casa essaae ag t)s't’ i (s'—s)s' ae (C=pr © 


where 


_1 +f Als) AAS) yr gty=— [AA as, 


waite 
7 St 


These equations would, however, be meaningful only if all the integrals con- 
verged. If not, the functions f(s), g(t) and the constant C must be assigned 
specified values in accordance with the renormalization condition, putting 


C=0, f(s)=A,,(s,0), g(t) = Ai,(0, ft), 
where A;, is the imaginary part of M(s, t) which appears as t increases for a given 
small s (just as A,, is the imaginary part which appears as s increases for a given 


small t). The first of these equations is obvious: C = M(0,0)=0. The second (and 
similarly the third) follows on comparing the equation 


M(s,0)= = { AS0Ce 


with the single dispersion relation (126.5) written “‘with subtraction” according to 
(126.20): 


MEHj= a cers AiehS el) ye (126.21) 


Thus the double dispersion relation ‘‘with subtraction” is finally 


St (s', a ' 
M(s, t)= od Naan = ai ds’ dt' + 


A Ce Ai (0, t’) 
Fl eae fut pyr ae’ (126.22) 


If s and t are themselves within the region of integration, the integrals (126.21), 
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(126.22) must as usual be taken in the sense 


S>s-Fi0,. tot +10. (126.23) 


§ 127. Photon—photon scattering 


The scattering of light by light (in a vacuum) 1s a specifically quantum- 
electrodynamic process; in classical electrodynamics it does not occur, owing to 
the fact that Maxwell’s equations are linear. t 

In quantum electrodynamics, photon—photon scattering is described as the 
result of the production of a virtual electron—positron pair by the two initial 
photons, followed by the annihilation of the pair into the final photons. The 
amplitude of this process (in the first non-vanishing approximation) is represented 
by six ‘‘square’’ diagrams with every possible relative position of the four external 
ends. These include the diagrams 


k, k, k k k 
\ 7 2N / 4 3N ye 
\ q A \ Pa \ -, 
—k aks 
am ao (127.1) 
FP g—k-k - 
q-k-k, “\ \ - 
a 


and another three which differ from these only in that the internal electron loop is 
traversed in the opposite direction. The contribution of these three diagrams is the 
same as that of the diagrams (127.1), and the total scattering amplitude is therefore 


My = 2(M + M+ M), (127.2) 


where M®, M®) and M“ are the contributions of diagrams (a), (b) and (c). 
According to (64.19) the scattering cross-section 1s 


_ 1 2_do! 
do = rae |Mji| Qo)” (127.3) 


where do’ is the solid-angle element for the direction k’ in the centre-of-mass 
system. The scattering angle in that system is denoted by 0. 


INVARIANT AMPLITUDES 
Writing separately the polarization factors of the four photons, we have Mj in 
the form 
My = etese3*es* Muu} (127.4) 
+ In the limit of low frequencies, this process was first discussed by H. Euler (1936), and in the 


ultra-relativistic case by A. I. Akhiezer (1937). The complete solution is due to R. Karplus and M. Neumann 
(1951). 
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the 4-tensor M,,,, (called the photon-photon scattering tensor) is a function of the 
4-momenta of all the photons. If the arguments of functions are written with the 
signs which correspond to like directions of the external lines in the diagram, it is 
evident from the symmetry of the group of diagrams (127.1) that 


Miurplk1, k, ~ k3, ~~ k4) 


iS symmetrical with respect to any interchange of the four arguments together with 
a simultaneous corresponding interchange of the four suffixes. Because of the 
gauge invariance, the amplitude (127.4) is unchanged when e is replaced by 
e +constant: k. Thus we must have 


Kt Miyoo = KS Miypo = °° = 0. (127.5) 


It is easily deduced from this that, in particular, the expansion of the scattering 
tensor in powers of the 4-momenta k,, k,,... must begin with terms containing 
quaternary products of the components, and certainly 


M,,19(0, 0, 0, 0) = 0. (127.6) 


To determine the actual invariant amplitudes, however, it is desirable to take 
from the start a particular gauge of the polarization 4-vectors e, in which 


ef =(O,e;), e+ =(0,e),.... (127.7) 
Then 
My = Mint ie oe 4 1€ $m, (127.8) 


where Miim 1S a three-dimensional tensor. 

We take as the two independent polarizations for each photon the circular 
polarizations with opposite directions of rotation, i.e. two helical states with 
helicities A = +1. 

The tensor Mx, can then be written 


Mixim = > Maj rgrgnge Pees? Fe Se 9: (127.9) 


AAQA3Aq 


the sixteen quantities M),,,,,,, are functions of s, t and u, and act as invariant 
amplitudes, but they are not all independent. 

The quantities M),,,,,;,, are three-dimensional scalars. Spatial inversion changes 
the sign of the helicities, while the invariant quantities s, t and u remain‘unaltered. 
The condition of P invariance therefore gives the relations 


Miciats; t, u) a M A 1,8A2,-A3, rAAS, t, u). (127.10) 


Time reversal interchanges the initial and final photons without affecting their 
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helicities; s, t and u again remain unaltered. The condition of T invariance 
therefore gives the equation 


My rorgagSs t, u) as Myyrga rls, t, u). (127.11) 


Lastly, one further relation follows from the invariance of the amplitude Mj; under 
the interchange of the two initial or the two final photons. If both interchanges are 
made (k; — k2, k; <> k4), the variables s, t, u are unaltered, and the interchange in 
the polarization indices leads to 


My raga; thu)= Maya rgas@Ss t, U). (127.12) 


It is easy to see that, because of the symmetry properties (127.10)-(127.12), the 
number of independent invariant amplitudes is only five, which may be chosen, for 
example, as 


Mi44+, Mas, M-+-, M,--4,  Mass- 


(the suffixes + and — denoting, for brevity, helicity values + 1 and — 1). 

If one of the amplitudes M,,,,,,,, is substituted for My, in (127.3), the result is the 
cross-section for scattering with specified polarizations of the initial and final 
photons. The cross-section summed over the final polarizations and averaged over 
the initial polarizations is obtained by the substitution 


|Myi|* > 4{2|Mi4+,|? + 2)/M.4--)? + 2/M,_,-/? + 
+2|M,__4|? + 8)M.44-)}. (127.13) 


The symmetry relations (127.10)-(127.12) connect different invariant amplitudes 
as functions of the same variables. Further functional relations are obtained from 
crossing invariance (§78), since the amplitude M,; describes the same reaction 
(photon—photon scattering) in every channel, and therefore must be the same for 
every channel. 

The s channel (corresponding to the arrow directions as shown in the diagrams 
(127.1)) is converted to the t channel by interchanging the 4-momenta k, and — k; 
(i.e. by changing the variables s<t) and interchanging the helicity suffixes 
A. <>~ Aj. Similarly, it is converted to the u channel by interchanging k, and — k, 
(s << u) and Ax<— — Ay. This leads to the relations 


M,_-+-(s, [, u) = M.+44(u, t, S), 
M,__.(s, t, w) = Mo44+(t, S, u), (127.14) 
M,444(S, t, uw) = My444(s, u, £); 


M..-_ and M,,,- are completely symmetrical in s, t and u.f It is therefore sufficient 


+ Here we have also used the symmetry with respect to the two final photons. Since the three 
variables s, t and u are not independent, it would be sufficient to write two arguments (for example, the 
first two), but we have retained all three, simply in order to clarify the symmetry of the interchanges. 
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to calculate only three of the sixteen amplitudes, for instance M,,,,, M,,-— and 
M,...4-. 

The relations (127.10)-(127.12) and (127.14) apply to the total amplitudes, i.e. the 
sums of the contributions of all three diagrams (127.1). But these contributions 
themselves are related in a manner which is obvious on comparing the diagrams. 
For example, diagram (b) is obtained from diagram (a) by the substitutions 
k,<> —ky, e@:<> ef, and so their contributions to the invariant amplitudes are 
obtained from each other by interchanging the variables s <u and the suffixes 
A2<>~—A4; similarly, the contribution of diagram (c) is obtained from that of 
diagram (a) by the changes t ~ u, A3 << — Ag. 


CALCULATION OF THE AMPLITUDES 


The integral M‘ corresponding to the diagram (127.1a) has the form (126.4), 
with 


4 
B® = “tr {ye(yq ~ yk2 + m)(yer) x 
xX (yq +m)(yes)(yq — yks + m)(yet)(yq — yki— yk2+m)}. (127.15) 


The integrals (126.4) are logarithmically divergent. In accordance with the 
condition (127.6), they are regularized by subtracting the value when k, = k,=---= 
0.¢ The calculation of the regularized integrals is, however, exceedingly laborious. 

The most straightforward way to calculate the photon—photon scattering am- 
plitudes is based on the use of the double dispersion relation (B. De Tollis, 1964). 
This method makes the most complete allowance for the symmetry of the 
diagrams, and almost entirely eliminates the difficulties of the integrations. 

The function A‘?(s, t) (and similarly A$?) for any given set of helicities A1, A», 
d3, Aq IS Calculated in accordance with (126.6); owing to the presence of two delta 
functions in the integrand, the value of B“™ is needed only for 


j=q@=m’, 14=(q—-—ki:—k.’ =m’. (127.16) 
These equations can be utilized in calculating the trace (127.15). For substitution in 


+ Inthe summation of the contributions from all the diagrams, the divergent parts of the integrals cancel, 
as is easily seen by noting the asymptotic form of the integral as q >=: 


Min % | telaCyamlvarncraeOah atalca’y. 
After averaging over the directions of q (cf. (131.10)), the trace is easily calculated, giving 
MY XK (Qrukrp + QrvBup — 2 2rp8yur) | d*qi(q’)’. 


The summation over the diagrams is equivalent to symmetrizing this expression with respect to the suffixes 
A, p, v and p, the result of which is zero. However, this is in a certain sense fortuitous, and does not remove 
the need for regularization, even though the latter amounts only to the subtraction of a finite quantity. 
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(126.22) we need the value of A{® only for t = 0. This implies that k = k’ and k, = ky. 
Then the integral (126.6) becomes 


Ae _ s~ im | BO dos... 
s(S, 0)= —tay/ s [(q _ ky - m’*]”’ (127.17) 
cf. the derivation of (115.10). With the angle & between q and k, we have 


(q —k,)’ — m’? = —2@(1— |q| cos 8) 
= —Vs[1-3V(s — 4m’) cos 8]. 


The integrals (127.17) can in fact be expressed in terms of elementary functions. 
The calculation of A$(s, t) from its definition (126.18) involves no integration; here 
the expression for B“™ is to be taken for the values of q given by (126.15), which 
satisfy not only (127.16) but also the conditions (q — k,)’ = m’, (q — ks)’ = m’. 
When the functions A,,, A;, and A, have been calculated, the dispersion relation 
(126.22) gives the amplitude directly as single and double definite integrals. We 
shall give the final result for the three invariant amplitudes which, according to the 
preceding discussion, are sufficient to determine all the other amplitudes:t 


soa Mies aot (2+) Bie) (2+) Bqw)- 


- [2 -! eb: 8 


= |er()+ T(u)] +2 (1-26, t) + 


4 


+4 (1-2)16, 0+ PE 16 4 4 
u S S 


g 
MG ae = = [He u), (127.18) 


a M..,-=1+ 4(< +5 +2 \[T(s)+ T(t) + T(u)] - 


{2 1,2 ee 
— (+ SOs O-4(F + Sy Os w)—4(5 + 7G wd, 
I asf pa ens _8 
Ba? Mi = 1 st I(s, t) Su I(s, u) tu I(t, u). 


Here, B(s), T(s) and I(s, t) denote the functions 


B(s)= Vi —4/s) sinh '3V —s—-1, s <0, 
T(s) =(sinh '3V—s), s <0, 


I(s, 1) = | sp Bamapatoelt—H0- 911-91 Hetty 


(127.19) 


+ Some further details of the transformations of the integrals, various representations of the 
transcendental functions B, T and I, and some limiting forms are given by B. De Tollis, Nuovo Cimento [10] 
32, 757, 1964; 35, 1182, 1965; V. Costantini, B. De Tollis and G. Pistoni, ibid. [11] 2A, 733, 1971. 


§127 Photon—Photon Scattering 571 


and the expressions in the ranges 0< s <4 and s >4 are obtained from (127.19) by 
analytical continuation with the rule s > s + i0, 1.e. through the upper half-plane of 
these variables. To simplify the notation, s and t denote s/m* and t/m’, in (127.18) 
and (127.19) only. 


SCATTERING CROSS-SECTION 

The limiting case of low frequencies (w <m) corresponds to small values of the 
variables s, t and u. The first terms in the expansion of the invariant amplitudes in 

powers of these variables are 

Mii++ = Lle*s’/45m‘*, My, ~ lle*t?/45m‘*, 
M,..-~ lle*u?/45m*, M..--=—-e(s?+t?+u)/15m‘, (127.20) 
M,44-~ 0. 

Substituting these values in (127.3), we find the cross-sections for the scattering of 


polarized photons. The differential scattering cross-section for unpolarized pho- 
tons, calculated from (127.13), is (in ordinary units) 


Ae 139 ) (a2 


6 
ont ts ~,) (3+cos?@) do! (127.21) 


and the total cross-section ist 


Oo 


Se *r2(42,) 
101257 °° °\me? 


6 
= 0.031a°r2( 25) , ho <mc’. (127.22) 


In the opposite (ultra-relativistic) case, the total scattering cross-section for 
unpolarized. photons ist 


a =4.7Ja‘(clw), he > mc’. (127.23) 


Finally, the differential cross-section for small-angle scattering in the ultra- 
relativistic case is 


a 


do = 
7 


4,2 
on log* a do, mc’|ha < 6 < 1. (127.24) 


This expression is valid with logarithmic accuracy (the next term in the expansion 
contains a power of the large logarithm lower by one unit). In the limit 6 =0 


+ In going from do to o, a factor + has to be included to take account of the identity of the two final 
photons. 
+t The origin of this dependence of o on w will be further discussed at the end of §134. 
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Ko/me? 
Fia. 24. 
(forward scattering), (127.24) is invalid, and is replaced by 
4.2 
do = aa log! do, 0<mc’|ho. (127.25) 


This expression is easily derived from the general formulae (127.18), putting t =0 
and noting that, for s > 1, the highest power (the square) of the large logarithm is 
present only in the function 


T (s/m?) =~ 4 log’ (s/m?) ~ log’ (w/m). 
To this accuracy, the only non-zero amplitudes are 
M444 =M_ = UV ists pee 16e° log’ (w/m). 


In particular, therefore, the photon polarization is in this case unchanged on 
scattering. 

Figure 24 shows the total scattering cross-section as a function of the 
frequency, plotted on a double logarithmic scale. The cross-section decreases 
towards both low and high frequencies, reaching a maximum when hw ~ 1.5mc’. 
The break in the curve at hw = mc’ corresponds to the change in the nature of the 
process when the production of a real electron—positron pair becomes possible. 


LOW FREQUENCIES 


For low frequencies (w <m), the photon—photon scattering amplitude can also 
be derived by a totally different method, based on the correction terms in the 
Lagrangian of a weak electromagnetic field ($129). 

The small correction V’ to the interaction Hamiltonian differs only in sign from 
that to the Lagrangian. From (129.21), 


4 
Vi=- ae | {(E? — H’) + 7(E - HY} d?x. (127.26) 
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Since this operator is of the fourth order in the field, it has matrix elements for the 
relevant transition, even in the first approximation. 
For the calculation, we substitute in (127.26) 


=~—dA/at, H=curlA, 


E 
A=V(4ir) >, (xr ex, @ + EX ef, e!), (127.27) 
k,A 


where A numbers the polarization; the S-matrix element is then given by 


— i¢fl{ V’ dtli) 


= ~ i |Ciay Chars | V' dt cin, cié,,|0) (127.28) 


(cf. §§72 and 77). When A is normalized as in (127.27), the scattering amplitude My; 
is found immediately from Sj: 


Sp = i 27 bO(ks + ka — ki — ko) My (127.29) 


(cf. §64). The mean value in (127.28) is calculated by means of Wick’s theorem, 
using (77.3), with contraction of only the “external”? operators ¢,,, €x, with the 
internal operators A. 


§ 128. Coherent scattering of a photon in the field of a 
nucleus 


Other effects which are non-linear, like photon—photon scattering, and are 
described by square diagrams of the form (127.1), are the disintegration of one 
photon into two in an external field (and the reverse process of combination of two 
photons into one), and photon scattering in an external field. The former cor- 
responds to diagrams in which one of the four external photon lines is replaced by 
an external field line; the latter process corresponds to diagrams with two external 
lines of real photons and two of virtual photons. 

This class includes, in particular, coherent (elastic) scattering of a photon in the 
constant electric field of a stationary nucleus. In general, the calculations lead to 
very lengthy formulae involving multiple quadratures.+ Here, only some estimates 
will be given. 

Because of the requirements of gauge invariance, the scattering amplitude as 
w—>0 must contain products of the components of the 4-momenta of the initial 
photon (k) and the final photon (k’), just as the expansion of the photon—photon 
scattering amplitude begins with the quaternary products of the components of the 
4-momenta of all the photons. Thus the scattering amplitude for a low-frequency 
photon is proportional to w’. Since also this amplitude involves the external field 


+ See V. Costantini, B. De Tollis and G. Pistoni, Nuovo Cimento [11] 2A, 733, 1971; B. De Tollis, M. 
Lusignoli and G. Pistoni, ibid. 32A, 227, 1976. 
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(the field of the nucleus with charge Ze) in the second order, we conclude that the 
scattering cross-section is 


do ~ Z'a‘r(@/m)i do (@ <m). (128.1) 


The frequency dependence is, of course, in agreement with the general results of 
859. 

The coefficient in (128.1) cannot be calculated from the Lagrangian for a 
uniform electromagnetic field (as was done for photon—photon scattering). The 
reason is that, in the process here considered, distances from the nucleus r ~ 1/m 
at which its field cannot be regarded as uniform are important. 

The result of the exact calculation is 


do. = do__ = 1.004 x 107(Za)*r2(w/m)y* cos* 36 do, 


(128.2) 
do, = do_, = 3.81 x 10°“(Za)*r(w/m)* sin‘ 36 do. 
Here, as in $127, the suffixes + and — denote the helicities + 1 and — 1 of the final 
and initial photons; @ is the scattering angle in the rest frame of the nucleus (V. 
Costantini, B. De Tollis and G. Pistoni, 1971). 

To estimate the cross-section at high frequencies, we use the optical theorem 
($71). The intermediate state which appears on the right-hand side of the unitarity 
relation is here a state of the electron—positron pair (corresponding to the division 
of the diagrams at two internal electron lines between external photon lines). The 
optical theorem therefore relates the amplitude for elastic scattering of a photon 
through an angle of zero and the total cross-section o,,i, for photon pair production 
in the field of the nucleus. If the amplitude f(w, 6) for scattering through an angle 6 
is so defined that the scattering cross-section is do = |f|? do (cf. (71.5)), we have 


im f(w, 0) = wopair/ 477. 


The cross-section Gpair 1S, of course, zero unless wm >2m. In the ultra-relativistic 
case, taking Opa, from (94.6), we get 


f"(w) = 1m f(a, 0) = 5 (Zay i © log F | (w >m). (128.3) 


The real part of the scattering amplitude is determined by the imaginary part, 
through the dispersion relation. The latter must here be written ‘with one subtrac- 
tion’’, i.e. for the function f/t (where t = w’), since as w >0 the amplitude f « w’; 
compare the dispersion relation ‘‘with two subtractions” (111.13). Separating the 
real part of the dispersion integral (for which it is sufficient to take the integral as a 
principal value), and changing from integration with respect to t'= w"” to that with 
respect to w’, we have 


f'(w') da' 


f'(@) =re flo, o=teln | Ligpaes So®— oF) (128.4) 
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When w >™m, the important values in the integral are w’~ w >™m, so that we can 
use the expression (128.3) for f"(w’); the lower limit of integration may then be 
replaced by zero. The principal value of the integral can be represented as half the 
sum of the integrals along paths on the upper and lower edges of the positive real 
axis in the complex w’-plane; these paths may then in turn be rotated in the 
w'-plane to lie along the positive and negative imaginary axes respectively. Then 


wo? f £GE+L(— i) a 
7 


2, de 2 Ve Z| dé 
= io (48 m E+’ 
0 
and the final result is 
() 2 
re f(w, 0) = 18 (Za) rew/m. (128.5) 


Note that the real part of the amplitude, unlike the imaginary part, does not contain 
a large logarithm. 

The sum of the squares of (128.3) and (128.5) gives the cross-section for 
scattering through an angle of zero as 


Zz 
doy. = goa (Za) n(2) {loe’ C2) +407} do (128.6) 
(F. Rohrlich and R. L. Gluckstern, 1952). 

The result (128.6) derived for scattering exactly forwards is valid also over a 
certain range of small angles. The condition for its validity can be shown to be 
6<(m/w)’. This range, however, makes only a small contribution to the total 
scattering cross-section. The main contribution to the latter comes from angles 
0<ml/qa, as is easily seen from the general (not only for angle zero) unitarity 
relation between the amplitudes for photon—photon scattering and photon pair 
production. In that range, however, the logarithmic term is absent, and the total 
scattering cross-section is thus 


o ~ (Za) r2(@/mye? ~ (Za) r2 (128.7) 


(H. A. Bethe and F. Rohrlich, 1952). For large w, therefore, the coherent scattering 
cross-section tends to a constant limit. 


§129. Radiative corrections to the electromagnetic fieid 
equations 


In the quantization of the electron—positron field ($25) it has been shown that 
the expression for the vacuum energy contains an infinite constant, which may be 


576 Radiative Corrections §129 


writtent 
é0=—D, eno» (129.1) 


where — e(?) are the negative frequencies of the solutions of Dirac’s equation. This 
constant itself has no physical meaning, since the vacuum energy is, by definition, 
zero. When an electromagnetic field is present, however, the energy levels (> will 
change. The changes are finite and are physically significant. They describe the field 
dependence of the properties of space, and alter the equations of the electromag- 
netic field in a vacuum. 

The changes in the field equations correspond to the change in the field 
Lagrangian. The density L of the Lagrangian is a relativistic invariant, and 


therefore can depend only on the invariants E’ — H’ and E- H. The usual expression 
Lo = (E’ — H’)/87 (129.2) 


is the first term in the expansion of the general expression in powers of the 
invariants. 

Let us derive the Lagrangian for the case where the fields E and H vary so 
slowly in space and time that they can be regarded as uniform and constant. Then 
L may be assumed not to involve the derivatives of the fields. The necessary 
conditions for this will be discussed at the end of the section. 

However, if the problem stated is to be meaningful, we must also assume the 
electric field to be sufficiently weak. The reason is that a uniform electric field can 
generate pairs from the vacuum. The field itself can be treated as a closed system 
only if the pair production probability is sufficiently small: 


IE|<m/’/je| (=m/’c’/lelh), (129.3) 


i.e. the change in the energy of a charge e over a distance h/mc must be small in 
comparison with mc’. We shall see below (cf. also Problem 2) that the pair 
production probability is then exponentially small. 

If there is a magnetic field as well as an electric field, it is in general possible to 
choose a frame of reference in which E and H are parallel. Then the magnetic field 
does not influence the motion of the charge in the direction of E. The condition 
(129.3) is to be satisfied in this frame, which will be the one used in the subsequent 
calculations. 

The calculation of the Lagrangian begins with that of the change W’ in the 
vacuum energy. This is given by the change in the “‘zero energy” (129.1) due to the 
field. From this, however, we must subtract the mean values of the potential energy 
of the electrons in the “‘states”’ of negative energy. The subtraction simply makes 
the total charge of the vacuum zero by definition. 

The zero energy in the presence of the field is 


. ee ee 
Bo=—Sew= > | whi swe a, (129.4) 
p.c 37 


+ Here we shall write € in place of E, to avoid confusion with the electric field. 
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where wS) are the negative-frequency solutions of Dirac’s equation in the field 
concerned. We shall assume that the integration is over a unit volume, and that the 
wave functions are normalized to unity in that volume; then & is the energy per 
unit volume. According to the preceding discussion, we have to subtract from €p 
the quantity 


Us= >) | Who* ediy) d’x, 
p,e 


where ¢ = —E-r is the potential of the uniform field. According to the theorem on 
the differentiation of an operator with respect to a parameter (see QM, (11.16)), 


Uy=E- py pie ot Woo d°x 
=—E- > deQ/aE 
Do 
=E- d6)/dE. 
Thus the total change in the vacuum energy density is 
W'= (6 )— E- 06 /0E) — (6) — E+ 0€0/0E)g-1-0. (129.5) 


We can relate W’ to the change L’ in the Lagrangian density (L = L)+ L’) by 
using the general formula 


W =>) qaL/aq-L, 


where q represents the “‘generalized coordinates”’ of the field (see Fields, 832). For 
an electromagnetic field, the quantities q are the potentials A and @. Since 


E=—A-Vod, H=curlA, (129.6) 


A is the only “‘velocity’’ q which appears in L, and the differentiation with respect 
to A is equivalent to one with respect to E; hence 


W'=E- dL'/aE-L'. (129.7) 
Comparison of (129.5) and (129.7) gives 


L' = —[@o — (@o)g=n=0l.- (129.8) 


Thus L' can be calculated by means of the sum (129.1). 
Let us first take the case where there is only a magnetic field. The “‘negative”’ 
energy levels of the electron (charge e = —|e|) in a constant uniform field H, = H 
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are 
—¢9=—V[m’+le/H(2Qn-1+0)+p?], (129.9) 
n=0,1,2,...;0=1 


(see §32, Problem). To find the sum, we note that the number of states in the 
interval dp, is 


|e|H dp. 


2a 2a 


(see QM, $112); the first factor is the number of states with various values of p,, 
which do not affect the energy. Moreover, all the levels except n =0, o =—1 are 
doubly degenerate, the levels n, o=+1 andn+1,o=—1 coinciding. Hence 


Eo= - ee FF vem? piy+2 5 von? +2leHn +p} dp, (129.10) 
n=] 


—o 


The divergence of the integrals in (129.10) is eliminated in the calculation of L’ 
(129.8) by subtracting the value of the sum when H =O. To carry out this 
‘renormalization’, it is convenient to calculate first the convergent expression 


d°S%, 
(am*) 


= 30RF, {(m? + p24 2 >) (m’ + 2le|Hn + py dp 
n=1 


: ae SEE Never rea 


The sum in the braces can be reduced to that of a geometrical progression, as 
follows: 


T 


o=—lB fem [2 ye 2e|Finy 1] dn 
0 


e|H nt 2 
--fe : iF v1] dn 
0 


ice) 


= ar e~™" coth (le|Hn) dn. (129.11) 


0 


To find L', we must now integrate ® twice with respect to m’ and then subtract the 
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value of the resulting quantity when H = 0. This gives 


—m2n 
gz | Go tnlelH coth (nle|H)~ 1} dn + e+ com’, (129.12) 


where c,; and c, depend on H but not on m’. 
From considerations of dimensions and of parity with respect to H, it is evident 
that L’ as a function of H and m must have the form 


L' = m*f(H’/m‘). 


Hence there can be no terms in L’ which are odd in m?’, and so c.=0. The 
coefficient c; is given by the condition that the expansion of L' in powers of H’ 
begins with a term in H*: a term in H’ would simply alter the coefficient in the 
original Lagrangian Ly) = — H’/87, and this would essentially signify a changed 
definition of the field and therefore of the charge. The elimination of the H’ terms 
thus corresponds to a renormalization of charge. It is easily verified that this is 
achieved by putting 


H’e’ { e” 


“3x8n J on OT 
0 


1 


Finally, making the change of variable m’y > 7 in (129.12), we have 
m* r d 
L'(H; E=0)= 872 | {— yb coth by +14+3b’n7e" per (129.13): 
0 


where b = |e|H/m’. 

Let us now go to the general case where there is not only a magnetic field but 
also an electric field E parallel to it, satisfying the condition (129.3). 

To find L' in this case it is not necessary to determine afresh the energy levels 
es of the electron in the field; we need only note that, if the wave function (the 
solution of the second-order equation (32.7)) is sought as a product 


= We(z) e?* Xno(Y), 
where ync(y) is the wave function in the magnetic field when E = 0 and p, = 0, then 
the mass m and the field H appear in the equation for we(z) only in the 
combination 


m’+|e|H(2n+1+ 0). 


If now the factor |e|H/27 is again taken from the summation over p, (the energy 
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levels being independent of p,), the dimensional argument shows that 


®(H, E) = a°L'|(am’y 


_o eH ss F([m* + je|H(2n + 1+ 0)]/|e|H) 


Sa a m’ + |e|H(2n+1+¢) 
b 1 “ F(1 + 2bn/a) en) 
nla 
=-g{F(q)+2), 1+ 2bn \ 
a =|e|E/m’; 


each term in the sum is — d*e$?/(dm*)’ summed over all the quantum numbers 
except n. Here F is a function as yet unknown, which will be derived from 
considerations of relativistic invariance. 

® must be a function of the scalars H?— E* and (EH) = (E- H)’: 


®(H, E) = f(H’— E’, (EH)’). 
Hence 
@(0, E) = f(— E’, 0) = ®GE, 0). 


The function ®(iE, 0) is obtained from (129.11) by putting H > iE; after a change of 
notation for the variable of integration, this gives 


(iE, 0) = a | 6 VOCS Hay: (129.15) 
0 


The function F can be found by comparing this expression with the limit ®(H > 
0, E) given by (129.14). The passage to the limit H > 0 in (129.14) can be effected 
by replacing the summation over n by integration over dn = dx/2b: 


1 1+x\ dx 
00, E)=-ga| F( ‘i ee 
0 
_ 1 f FY) 
= ga | dy. (129.16) 


Equating the expressions (129.15) and (129.16) and differentiating with respect to 
l/a =z, we find 


[e,¢) 


F(z)/z = -| e™ » cot n dn. 


0 


The summation in (129.14) then reduces again to the summation of a geometric 
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progression, and the subsequent calculations are similar to those given previously: 
we express ® in terms of m’, E and H, integrate twice with respect to m’, subtract 
the value for E = H =0, and determine the constants of integration as in the 
derivation of (129.13). The final result ist 


4 r “7 
Lie z i eu {—(na cot na)(nb coth nb) + 1—37(a?— b’)} dn, 
0 


(129.17) 
a = |e|E/m? (= |e|hE/m’c*), — b = |e |H/m? (= |e|hH/m’c’). 
The parameters a and b may be written in the invariant form 
a=— gel {(F + iG)" —(F — iG)"}, 
(129.18) 
b= Hh {(F + iG)!" + (F — iG)""}, 
where # and & denote the invariants 
F=(W-E’), G=E-H, F+iG=i(H+iEY. (129.19) 


When (129.17) is expressed in terms of the invariants ¥ and G, it becomes 
applicable in any frame of reference (not only in that where E||H). 

The formula (129.17) is written in a somewhat arbitrary manner. It is valid only 
if the electric field is small: a <1 (129.3); this condition is not shown explicitly in 
(129.17), but can be seen from the fact that the integrand in (129.17) has poles at 
yn =nnla (n=1,2,...), and the integral as written above has, strictly speaking, no 
meaning. Hence (129.17) can essentially be used only to derive the terms of the 
asymptotic series (see below) in powers of a by a formal expansion of cot a. 

The integral (129.17) can be given a mathematical meaning by passing round the 
poles in the complex 7-plane. Then L’, and therefore the energy density W’, have 
an imaginary part. Since the energy is complex, there are quasi-stationary states. 
In the present case, the stationarity condition is violated by pair production, and 
—2im W' is the probability w of pair production per unit volume and time; since 
the small increments of W and L differ only in sign, the probability w, expressed in 
terms of E and H, is simply 


w=2im L’. (129.20) 


This is clearly proportional to e~”*“ (see (129.22) below). Because im W’ is 
exponentially small when a <1, an asymptotic series in powers of a, retaining any 
finite number of terms, is meaningful. 


+ This was first derived by W. Heisenberg and H. Euler (1935). The analysis given above makes use 
also of the principles of a proof suggested by V. F. Weisskopf (1936). 

+ The direction of passage round the poles must be chosen so that im W’ <0; this corresponds to 
the usual rule m?—> m? — iO (i.e. here a> a + iO). 
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Let us consider the limiting cases of formula (129.17). In weak fields (a <1, 
b <1), the leading terms of the expansion are 


4 2_ py? eas b 2 4 
pi re () = a5xRghyt t+ 79"). (129.21) 


In particular, when b = 0 the relative correction 1s 
1 16 a aa’/457. 
The imaginary part of L’ for a <1 is obtained from the integral (129.17) by 
taking half the residue at the pole of the cotangent nearest to the origin, i.e. at 


na = 7 —i0. From (129.20), this gives the probability of pair production by a weak 
electric field: 


Se aa 
w= ae 
or, in ordinary units, 
1 /eEh \* mc? (mc\? mc? 
a (a3) eu. xP G TelRE ) (lenee) 


In a strong magnetic field (a =0, b> 1), we start from (129.13), written (with 
bn > 7) as 


| ew ; n coth yn — 1 
L'= ps SOE Ty. 
ps n 3 n 7) 


When b > 1, the important range in this integral is 1 < y <b, in which e-”” ~ 1 and 
we can neglect the second term in the brackets, terminating the range of integration 
(with logarithmic accuracy) at ny ~ 1 and y = b. Then 


L' = (m‘b?/2477’) log b; (129.23) 
in a more exact result, log b becomes log b — 2.29. The ratio L’/Lo is here 
L'/Lo = (a/377) log b, 


from which we see that the radiative corrections to the field equations may become 
of relative order unity only in exponentially strong fields: 


H ~(m’/le)e*™™. (129.24) 
The corrections calculated above are, nevertheless, meaningful: they remove 


the linearity of Maxwell’s equations, and thus lead to effects which are in principle 
observable (e.g. scattering of light by light or in an external field). 
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The relation between the fields E and H and the potentials A and @ remains, by 
definition, as before (129.6), and there is therefore also no change in the first pair of 
Maxwell’s equations: 


dre. <a S = (129.25) 


The second pair of equations are obtained by varying the action 


S =| (Lo+ L') d*x 


with respect to A and ¢, and can be written 


curl (H— 47M) = <= (E+ 47P), (129.26) 
div (E+ 47P) = 0, (129.27) 

with the notation 
P=<aL'/aE, M=4L'/aH. (129.28) 


Equations (129.25)-(129.27) agree in form with the macroscopic Maxwell’s equa- 
tions for a field in matter.t Hence we see that P and M signify the electric and 
magnetic polarization vectors of the vacuum. 

Note that P and M are zero for the field of a plane wave, where both invariants 
E’ — H’ and E- H are zero. For a plane wave, therefore, the non-linear corrections 
are Zero in a vacuum. 

Lastly, let us consider the conditions for the above formulae to be valid. If the 
fields are to be regarded as constant, their relative changes over distances or times 
of the order of 1/m must be small; this ensures that the corrections to Lo arising 
from the derivatives are small in comparison with Ly itself. For instance, if the field 
is only time-dependent, this gives the obvious condition 


w <m. (129.29) 


For a weak field, however, there is also a more stringent condition. This occurs 
because the fourth-order term (129.21) must be much larger than the correction to 
Ly) quadratic in the derivatives; otherwise, the fourth-order term would have no 
meaning. For example, in an electric field depending only on the time, this leads to 
the condition 


w <mle|E/m?, (129.30) 


which is more stringent than (129.29). 


+ In making the comparison, it must be remembered that in macroscopic electrodynamics the mean 
value of the magnetic field is denoted by B, not by H as here. 
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The condition (129.30) does not arise, however, in the problem of photon— 
photon scattering considered in the last part of 8128. There, we are concerned from 
the start with a four-photon process only, described by the fourth-order terms in 
the Lagrangian, and the relative magnitude of the other terms in L’ ts irrelevant. It 
is therefore sufficient if the condition (129.29) is satisfied. 


PROBLEMS 


PROBLEM |. Determine the correction to the field of a small stationary charge e; due to the non-linearity 
of Maxwell’s equations. 
SOLUTION. For H = 0, (129.21) gives 


P= 0dL'/dE 
= (a7/907°m*) E’E. (1) 
In the case of central symmetry, (129.27) gives 
(E+ 4aP)r’ = constant = e1, (2) 


the value of the constant being obtained from the condition that as r>~© the field is the Coulomb field of 
the charge e;. An approximate solution of (2) is 


E =(e lr’) — 2a’ei/45am‘r’*), 
or 


® = (e,/r)(1 — 207e7/225 amr’). (3) 


The correction in (3) that is non-linear in e; is to be distinguished from the linear correction in 
(114.6), due ultimately to the non-uniformity of the Coulomb field. The correction (3) is of a higher order 
in a, but decreases more slowly with increasing distance and increases more rapidly with e). 

PROBLEM 2. Estimate directly the probability of pair production in a weak uniform constant electric 
field in the quasi-classical approximation, to exponential accuracy (F. Sauter, 1931). 

SOLUTION. The motion is quasi-classical in a weak field E (which has a slowly varying potential 
o@ = —-E-r=-— Ez). Since the reaction amplitude contains the wave function of the final positron as the 
initial ““negative-frequency” function, pair production may be regarded as a transition of an electron 
from a “‘negative-frequency” to a “‘positive-frequency” state. In the former state, with the field present, 
the quasi-classical momentum is determined by the equation 


e =~ V[p(z)+m’] + lel Ez, (1) 
and in the latter state by 
e =+V[p*(z) +m’) + |e| Ez. (2) 
The change from the first to the second state implies a passage through a potential barrier (the region of 
imaginary p(z)), which separates the regions where the functions (1) and (2) apply with real p(z) for a 
given e. The boundaries z; and z2 of this barrier occur at p(z) =0, Le. 
e=-—mt|elEx, e=+mtlel Ez. 


The probability of passage through a quasi-classical barrier is 


w * exp (-2 f |p(z)I dz ) 


= exp (am Fva—e) dé), 
0 
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whence 
w <exp(—am’/le| E), 


in agreement with (129.22). 


§ 130. Photon splitting in a magnetic field 


The non-linear corrections in the electromagnetic field equations give rise to a 
number of specific effects in photon propagation in external fields. 

In order to put these equations in a more familiar form (cf. the last footnote), 
we shall denote the electric and magnetic fields in this section by E and B; D and H 
will denote the quantities 


D=E+47P, H=B-47M, P=d0L'/dE, M= 0L'/oB. 
Equations (129.25)-(129.27) then become 


div B = 0, | (130.1) 


divD=0, curl H = dD/ot. 
Let us consider photon propagation in a constant uniform magnetic field Bo. 


Denoting by a prime the quantities which relate to the weak field of the elec- 
tromagnetic wave, we have for these the equations 


kX H’=—-oD’', kXE’'=oB, 
(130.2) 
k-B’=0, k-D’'=0, 
with 
Di = x E., Bi = pas; (130.3) 


the vacuum permittivity and permeability tensors are functions of the external field 
By. Assuming this field to be so weak that |e|Bolm? <1, we find from the Lagrangian 
(129.21) 


2 4 
cin = Su t+ geena BUC — Sn +3 bibs), 

a (130.4) 
pir = 8x + Zea BO(Su + 2bibs), 


where b = Bo/ Bo. 

The photon frequency is assumed so small that wo <m (129.29). However, the 
structure of the tensors ¢ and ux does not depend on this assumption; it follows 
from the invariance of quantum electrodynamics under spatial inversion and charge 
conjugation. The first of these prevents the occurrence in D’ of terms having the 
form constant x B’ or constant < Bo(Bo - B’) (since inversion changes the sign of E 
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and D but leaves H and B unchanged); the second prevents the occurrence in cx 
and pi. of terms antisymmetric and odd in Bo, of the form eBo (since charge 
conjugation changes the sign of all fields). 

Since the problem under consideration has a distinctive plane, namely the 
kb-plane, it is reasonable to take the linear polarizations in and normal to this plane 
as the two independent polarizations of the photon. The subscripts 1 and || will 
denote polarizations in which the vector B’ is respectively perpendicular to the 
kb-plane and in that plane. 

For perpendicular polarization, the vector H' is, like B’, at right angles to the 
kb-plane: 


The vectors E’ and D’ are in that plane. Then, from equations (130.2), we obtain the 
photon dispersion relation k = n,w, with the “refractive index” (in ordinary units) 


4 
n= 1+, B?2 sin? 0, (130.5) 


where 0 is the angle between k and Bo.t 
In the second case, B’ and H’ are in the kb-plane, E’ and D’ perpendicular to it. 
The refractive index is found to be 


4 


eT - B? sin? 6. (130.6) 


Note that n, 2 nj. The equality occurs when 0 = 0, n, = n= 1. 

The most interesting manifestation of the non-linearity of Maxwell’s equations 
with radiative corrections is the splitting of a photon into two in an external 
magnetic field (S. L. Adler, J. N. Bahcall, C. G. Callan and M. N. Rosenbluth, 
1970). 

In a constant uniform field, this process occurs with conservation of energy and 
momentum.¢ In the decay of a photon k into photons k; and k,, we have 


w(k) = w(k;) + w(k2), kj +k. =k. (130.7) 


For photons in a vacuum, in the absence of external fields, w = k and the equations 
(130.7) can be satisfied only for three photons moving in the same direction. In that 
case, however, the decay is rigorously forbidden by the invariance under charge 
conjugation: Furry’s theorem (§79) shows that the sum of diagrams with three 
photon free ends is zero. 


+ Expressing B’ in terms of H’ in the second equation (130.2), we substitute H’ from there in the first 
equation, and then take the projection of the latter on the direction of b. The product k - FE’ is expressed 
in terms of b- E’ by means of the equation k- D’ = 0. 

+ The conservation of momentum is due to the spatial uniformity of the field, but of course occurs 
only for processes involving uncharged particles. The Lagrangian for charged particles contains not only 
the fields but also the field potentials, which depend on the coordinates even in a uniform field. 
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The presence of an external field makes the decay of the photon possible; this 
decay is represented by diagrams with three photon ends and one or more 
external-field lines. The possibility is, however, dependent on the nature of the 
photon polarization. The dependence may be deduced from the conservation laws 
(130.7) and the change in the photon dispersion relation in a magnetic field. 

The dispersion relation may be written 


w =k + B(k), (130.8) 


where B(k) is an increment that is small (in a weak field). Its presence makes 
possible, in principle, the fulfilment of equations (130.7) for momenta k, and k, lying 
in a certain narrow cone near the direction of k. Since the directions of all three 
vectors k, k,, k, are close together, they can all be regarded as parallel to k in the 
small terms B(k), and we can take k;+k,=k. The law of conservation of energy 
then becomes 


B(«k) — By (ek) — Boek — ek,) = ky + |k- k,|—k 


(where x =k/k); since the dispersion relation depends on the polarization of the 
photon, the functions B, B;, B2 may be different. Since 


kk, 


a = oe: _ aoe ee —_KKi sq? 
|k —k,| = [(k — k,)° + 2kk,(1 —cos 9)] k kit aK =) vo, 
where @ is the small angle between k and k,, we have 
B(«k) — Bik.) — Boek) = kk, 87/2(k — ky) > 0. (130.9) 


This inequality specifies the properties of the dispersion relation that are necessary 
for decay. 

For frequencies w <m, the dispersion relation is given by (130.5) and (130.6), so 
that B(k) ~ — k[n(«) — 1], where the function n(«) depends on the direction of the 
vector k but not on its magnitude. Then we must have 


kiny(«) + (k — k,)n2() — kn (x) > 0. (130.10) 
Since n, > nj), this condition immediately excludes the decays 


Yi>vyt Vp Ys> Wt Ya, 


where y denotes a photon, and | and || correspond to the two polarizations defined 
above.t 
For the decays 


Yi>x yet Yi, Yt yp 


+ Numerical calculations show that n, > nj; is true not only when w <m and (130.5) and (130.6) are 
valid, but for all wa <2m, the threshold for pair production by the photon. 
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the left-hand side of (130.10) is zero, since the functions n, n;, n. are the same. To 
solve the problem in this case, we have to take into account the dependence of the 
refractive index on k which appears as w increases. The required inequality is 


kin(«, k,) + (k — k,)n(«, k —k,)— kn(«, k) > 0. 


It can be shown by general arguments that n(«, k) is an increasing function of k, 
and so this inequality cannot be satisfied, so that the above decays also are 
impossible: replacing n(k —k,) and n(k,) by n(k) will certainly increase the sum, 
and the result of these changes is a sum equal to zero. This conclusion applies to 
any transparent media, and follows from Kramers and Kronig’s formula for the 
refractive index (see ECM, 864). In the present case, the external field is a 
‘transparent medium’ for photons of all frequencies w<2m, up to the pair 
production threshold, i.e. the photon absorption threshold. 
Thus the only decay processes allowed are 


Y=? Ya" Vas (130.11) 
YiI7 Wy Ya (130.12) 


It has already been noted that the momenta k; and k) are at small angles 0 to 
the initial photon momentum kK. If these angles are neglected, 1.e. if the momenta of 
all the photons are assumed to be parallel (the collinear approximation), then the 
decay (130.12) is impossible, as may be shown in the following way. 

Similarly to (127.4), we express the decay amplitude as 


My = Myve*et*e3*, 
where e, e€;, €2 are the photon polarization 4-vectors, defined as usual by their 
4-potentials A. With the three-dimensional potential gauge, e =(0,e), we can 
rewrite this as 
My = Mireve txe 3. 
Two independent polarizations are defined by the unit vectorst 


ek <b, e,|Ik x (k Xb). (130.13) 


It is easy to see that, in the expansion 
— A A A 
Miu = >» Maar e} ey Vel 2», 
A,Ay,A2 


where A, Aj, Az take the values 1 and || (cf. (127.9)), the vectors e, must occur an 


+ The suffixes || and 1 refer to the polarizations defined above. It must be remembered that the unit 
vectors e determine the direction of the vector potential A (and therefore of the field E’) and are 
perpendicular to B’. 
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even number of times (0 or 2) in each term. The amplitude M,;; is invariant under 
the CP transformation, and since the potentials A (and therefore e) are CP- 
invariant, so must be the tensor My . Under the CP transformation e;— e, e, > — e.; 
charge conjugation changes the sign of b, inversion changes that of k while leaving 
the axial vector b unaltered. Hence, if the vector e, occurs once in any term of the 
expansion, the corresponding scalar M,,,,, must be CP-odd. But it is impossible to 
construct a CP-odd scalar from only two vectors (in the collinear approximation) 
k =k, =k, and b, both of which change sign under the CP transformation. This 
proves the above statement. 

The decay (130.12) is therefore forbidden in the collinear approximation. A 
more detailed analysis shows that the ratio of the amplitude of this process to that 
of the decay (130.11) allowed in the collinear approximation, 


Mit _ 92 a (Bol B.,)’, (130.14) 
M1 


where 
B= m’/le| (= m’c*/lelh = 4.4 x 10°G); 


the angles 3 are estimated from (130.9) as 0° ~ n,— ny. 

The fact that the only possible decay (in the principal approximation) is 
Yi y+ y. implies that | polarization is eventually established in an unpolarized 
photon beam propagating in a magnetic field. 

Let us now calculate the decay amplitude M;; = M_.., by perturbation theory, 
assuming that By < B,,. 

The first non-vanishing Feynman diagrams (with respect to a, and with respect 
to the external field) are of the form 


Kee, ea nag cs da A 
| | (130.15) 
kK ——~ — —-<« — 


with all possible permutations of the ends, three ends corresponding to photons and 
one to the external field. In the collinear approximation, however, the amplitude 
corresponding to these diagrams is zero. For, as a result of gauge invariance, the 
external field can appear in the amplitude of the process only as a 4-tensor of its 
field strengths F,,, and the photon polarization 4-vectors only in the antisymmetric 
combinations 


fur = Kye, — Ke, 


with the wave 4-vectors. The final expression for the amplitude is constructed from 
the external field tensor F,,, the tensors f,., fi, and f2,, of the three photons and 
their wave 4-vectors k,, ki, k2,; it must be linear in each of the tensors f,,, and for 
the diagrams (130.15) it must be linear in F,, also. In the collinear approximation 
the 4-vectors k, and k, reduce to k: k; = kw;/w, k2 = kw./w. Under these conditions, 


QE4 - MM 
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any scalar product formed as stated above is identically zero: we can easily show 
that such a product will contain at least one zero factor k’ or ke. 

In the collinear approximation, therefore, the first non-zero contribution to the 
decay amplitude comes from hexagonal diagrams of the form 


ee alter 2S age ee 
oe <{ )- - (130.16) 


with three external-field lines.f The amplitude corresponding to such diagrams 
involves three factors F,,. Scalar products of this kind need not be zero, but all 
non-zero products contain the photon wave vectors only through the tensors f,,,: it 
is easy to see that the addition of further factors k will lead to the presence of zero 
factors k? or ke in the products. The components of the tensor f,, are the same as 
those of the photon fields E’ and B’. This means that, if the decay amplitude 
corresponding to the diagrams (130.16) is represented as the matrix element of an 
operator, then that operator, expressed in terms of the photon field operators, is 
independent of the photon frequencies. Hence it follows in turn that the calculation 
of the scattering amplitude corresponding to the diagram (130.16), by means of the 
Lagrangian (129.17), gives the correct answer without restriction to the case w <m. 

It has been shown at the end of 8127 how the interaction Hamiltonian is 
obtained from the Lagrangian L found in §129. We now have a process involving 
three photons, and the corresponding interaction operator is found from the terms 
in the expansion of L which contain products of three photon fields E’ and B’. Here 
we need consider only the term 


(B’ « Bo)(E’ + Bo)’, (130.17) 


in which each of the vectors B’ and E’ appears as a scalar product with Bo: the 
products E”, B” and E’ - B’ arise, in the four-dimensional notation, from scalars of 
the form f,.f*’, which in the collinear approximation are identically zero. The 
selection of the term containing one factor B’ and two factors E’ is made because 
the process under consideration involves one || photon and two 1 photons; for the 
former, the field B’ has a component along Bo, and for the latter the field E’ has such 
a component. 

The Lagrangian L is expressed in terms of the invariants ¥ = 3(B’—E’) and 
“4 =E-B. The required term in the expansion comes from a term proportional to 
#G*, A calculation using (129.17) gives for this term 


13e° 
“6300? e 


Putting B = By) + B’, E= E’, and taking the term By): B’ from & and By: E’ from @, 
we get the required expansion term having the form (130.17). Thus the three-photon 


+ The corrections arising from the inclusion of non-collinearity in the diagrams (130.15) would give a 
contribution to the amplitude that is of the next higher order in a relative to that from (130.16). 
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interaction operator for the decay yj— yi. + ya. is 


13e° 


(3) 
v 315a7m® 


[ By -Bp@B,- £3)B, - BY a°s, (130.18) 


where 
B= iV (4k X eye" "Ey, 


Ei =~ iV (47) wie, BE Oi 


and similarly for Ej; cf. (127.26), (127.27).t 
According to the rules given in 864, the decay amplitude M,;; is calculated from 
the definition 


Si = (f1[ V ati) = — 12m)86%(ke — ky — ke) Mp 
and is 


13e° 3/2 3033 
~§aysca a (47) ww)w2B 9 sin 0, 


M;; —= 
6 being the angle between k and Boy. The decay probability per unit time is (see 
(64.11)) 


dw = (27)*8(k — k; — k)6(w — w; — w2)|M (edhe 
irre I Do aay Lor On)” 

the extra factor 3 takes account of the decrease in the phase volume due to the 

identity of the two final photons. The first delta function is eliminated by in- 

tegration over d°k,. To eliminate the second we note that, if dispersion is neglected, 


@ — 0,—- @,.=k—k,—|k—k,| 


~ —_kki py _ 
a ain cos 01), 


and thereforet 


wo | 
| | Ww 1075(@ — @;— @2) d cos 91° 27H} dw, 
0 0 


@ 


SL | wi(w — @1)’'do; = 7o"/15. 
0 


+ The coefficient in (130.18) is doubled because Ej; and E} can be taken from either of the two factors 
E’ in L. 

+t Here it is assumed that, with dispersion taken into account, the argument of the delta function in 
fact has a zero at some cos 0; < 1. Thus a dispersion of this type is necessary for decay to be possible, 
but the decay probability itself does not depend on the amount of the small dispersion. 
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We finally have for the total photon decay probability per unit time (in ordinary 
units) 


as ( 13 ) me (te,) (Be sin a 


"159° \315) fh \me? B., 
2 Dee aD 3 9 5 
= «6 mc~ (Bo sin my (+) (5) 
ue h Ge. mc} \mc?) ° eel?) 


As already mentioned, the condition w <m is not necessary for this formula to 
be valid. Its validity is restricted only by the condition that the terms corresponding 
to eighth-order diagrams be small. To obtain an estimate, we note that the 
eighth-order matrix element may contain, for example, a term differing from the 
sixth-order terms by a dimensionless invariant factor of the form (eF“’k,/m°*). The 
condition for this to be small is a very weak one: 


w <m(m’*/\e|Bo). 


§ 131. Calculation of integrals over four-dimensional regions 


We shall now give some rules and formulae that are useful in the calculation of 
integrals arising in the theory of radiative corrections. 
The typical form of integral corresponding to a Feynman diagram is 


_f(k) d*k_ (131.1) 
A,;a,..- Qn 
where a, do,... are second-degree polynomials in the 4-vector k, f(k) is a 


polynomial of some degree n’, and the integration is over the whole of four- 
dimensional k-space. 

A convenient method for calculating such integrals, due to Feynman (1949), is 
based on an initial transformation or parametrization of the integrand by the use of 
further integrations with respect to auxiliary variables &,, &,...: 


a)\a2... (a1€) + Qo. +: “dpcay 


0 0 


This transformation replaces the n different quadratics in the denominator by the 
nth power of a single quadratic polynomial. 

Eliminating the delta function by integrating over d&, and using new variables 
defined by 


Ey = Xn-1,5 £, = Xn-2 —~ Xn=-1, +++ 5 En-1 = X1 — Xo, 


Cpt Gosh Sg = ys 
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we get as an equivalent form of (131.2) 


1 xy Xn-2 
] 
mara Df ds fdr fF davx 
0 0 0 


] 


ea 131.3 
[GiXn-1 a A(Xp-2 — Xn-1) See a, (1 — x1)]" ( ) 


When n = 2, this formula becomes 


_f_ a 
[a,x a ay(1 —_— x))”’ 
0 


(131.4) 


as can be easily verified. For any value of n, the formula can be proved by 
induction: on carrying out the integration over dx,-; in (131.3), we get on the 
right-hand side the difference of two (n — 2)-fold integrals of the same form. If the 
formula is assumed valid for these, we have 


1 1 1 | 
A;—@.|Q00;...Q@, @:d3...a, 1’ 


which is equal to the left-hand side of (131.3). 

By differentiating (131.3) with respect to a), a2, etc., we can derive similar 
formulae which can be used for the parametrization of integrals whose denomina- 
tors contain any of the polynomials in powers above the first. 

The divergent integrals are regularized by subtracting from them other integrals 
of similar form. To determine the difference, it may be convenient first to 
transform the difference of the integrands (each of which is already transformed by 
means of (131.2)) as follows: 


ee ees n(a — b) dz 
a” pb" ieessressau (131.5) 
0 


After the application of (131.3), the four-dimensional integration in (131.1) becomes 


f(k) d*k 
Reeser (131.6) 


—1Y-a’] ns 


where | is a 4-vector and a’ is a scalar, both of these depending on the parameters 
X;,.--,Xn-1; the scalar a’ will be assumed positive. 

If the integral (131.6) converges, we can make the change of variables k —1-k 
(a shift of the origin), which gives (with a different function f(k)) 


foe. (131.7) 
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the denominator now involving only the square k*. In the numerator we need only 
consider scalar functions f = F(k’): for integrals having numerators of any other 
form, 


k"F(k?) d*k 
“(ke-a)" 0, (131.8) 
k*k’F(k*) d*k d*k _ k?F(k*) d*k d*k 
| (k* — (k= a2)" ~~ 18” | (ke a2)" a’)" ’ (131.9) 
jeer *) d*k 
(k? aye 


*\? F (k’) d*k 


I len vo Co V, k 
= 54 (ee + gre + greg) [Ee (131.10) 


and so on, as is evident from symmetry on integration over all directions of k. 

In the original integral (131.1), each of the factors a), ao,... in the denominator 
has (as a function of ko) two zeros, which are avoided in the integration over dky 
according to the general rule (§75). After the transformation to the form (131.7), the 
2n simple poles of the integrand are replaced by two poles of order n, which are 
avoided according to the same rule (path C in Fig. 25). By moving the contour of 
integration as shown by the arrows, it can be converted to the imaginary axis in the 
ky plane (path C’). Thus the variable ky is replaced by iko with ko real. Then, writing 
also k' for k, we have 


ke = kp—- ko (ke +k?) = —k”, (131.11) 
where k’ is a 4-vector in the Euclidean metric; and 
d*k > id*k' = ik’ dk”) dQ, 


where dQ) is an element of four-dimensional solid angle. The integration over dQ 
gives 27’ (see Fields, §111), and so 


d*k > in*k’d(k”). (131.12) 


FIG. 25. 
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Putting k” = z, we have finally 


F(k’) d*k rF — d 
eye ) Bene 1)"in? oe. (131.13) 
0 
In particular, 
d*k (—1)"ir’ 
lew _ a’)" ay che (n i 1)(n = 2) (131.14) 


The logarithmically divergent part of the integrals (131.7) can be separated as 


laces (131.15) 


It is easily seen that in an integral of this type we can replace k by k+l: the 
difference is 


\ecy ere aryl 


and is a convergent integral, so that the change k > k +1 is certainly permissible in 
this integral. On doing so and also changing the sign of k, we get the same quantity 
with the opposite sign, and it must therefore be zero. 

A linearly divergent integral would have to have the form 


(att ma (131.16) 


but in fact such an integral is only logarithmically divergent: the integrand tends 
asymptotically to k"/(k’)’ as k >» and gives zero on averaging over directions. The 


change of origin, however, adds a constant to the integral (131.16). This can be 
shown for the infinitesimal change k > k + 6l, by calculating the difference 


w= | graprat a al 


As far as the first order in dl, 
Ait 4k"(k dl) __—l* 
(k?-—a’y a’) (k?— Coy" 


In the first term, averaging over directions replaces the numerator by k’8l" (cf. 
(131.9)), and hencet 


a (a ae ee ee 
=a’ dl (kee—-ay 2 él’. (131.18) 


+ The corresponding result for finite | can be obtained by more laborious calculations. 


596 Radiative Corrections $13] 


In the final expressions for the radiative corrections there is frequently a 
transcendental function defined by the integral 


g 
F(é)= { 280 =*) as, (131.19) 
0 


sometimes called Spence’s function. Some of its properties (given here for 
reference) are 


F(é)+ F(/E) = ga’ +3 log’ é, (131.20) 
F(— &)+ F(-1+ €) = 6m’ + log € log (1 — 8), (131.21) 
FQ)=p7’, F(-1)=—én’. (131.22) 


The expansion for small & is 


F(é) = €-42 +98 —ief* + +. (131.23) 


CHAPTER XIII 


ASYMPTOTIC FORMULAE OF QUANTUM 
ELECTRODYNAMICS 


§ 132. Asymptotic form of the photon propagator for large momenta 


THE first-order term (with respect to a) in the expansion of the polarization 
operator P(k’) has been calculated in §113, and it was found that for |k?| > m7? this 
term is, with logarithmic accuracy, 


2 
P (k?) = = k? log Ky (132.1) 


It was also mentioned that the derivation of this formula (as a first-approximation 
correction to the propagator 47D ' = k*) assumed the condition 


2 


which placed an upper limit on the permissible values of |k’|. We shall now show 
that the expression (132.1) is in fact valid also with the much weaker condition 


© jog lk 
logy < 1. (132.3) 


The proof is as follows.t We first note that, although in principle the condition 
(132.3) allows contributions to #(k’) from the terms of all orders (in a) in the 
perturbation-theory series, in every order n only the terms ~a” log"(|k|’/m’) need 
be considered, which contain the large logarithm in the same power as a; the terms 
containing lower powers of the logarithm are certainly small, since a <1. 

The analysis of the perturbation-theory series for can be reduced to that of 
the series and I, using Dyson’s equation 


4 
P(k?) =i tr | y.9(p + k)U*(p +k, p; kK)G(p) se : (132.4) 


see (107.4). Since the function P(k’) is gauge-invariant, it can be calculated with 
any gauge for and I. The most convenient here is the Landau gauge, in which the 


+ This formulation and conclusions are due to L. D. Landau, A. A. Abrikosov and I. M. Khalatnikov 
(1954). 
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free-photon propagator has the form (76.11): 


4 kk, 
D,.Ak) = 73 (gu a su (132.5) 


(D” = 0 in (103.17)). It is found that in this gauge the perturbation-theory series for 
@ and I’ do not contain any terms with zero power of the logarithms. In (132.4) it 
is therefore sufficient to substitute the zero-order approximations = G,I"* = y", 
obtaining 

TO d*p 


D(k2) =i - ATO ty wG(p + Ky"GO) Go (132.6) 


This is the Feynman integral corresponding to the diagram (113.1) of the first 
approximation (with respect to a), and leads (after appropriate renormalization) to 
(132.1). 

In order to prove the foregoing statements, let us first examine the origin of the 
logarithm in the integral (132.6). It is easily seen that the logarithmic term comes 
from the range of integration 


p’>>|k?| when |k*|>m?*: (132.7) 
formally expanding G in powers of 1/yp, we, have 


G(p)~ 1/yp = yplp’, 
G(p — k) ~ 1/(yp — yk) 


I ee ee 
Oe are ae, ee eee, 
yp yp. yp yp’ yp yp 


_ vw? , OPP) 
P| OU 
i @/9102.9107210,910/:2) 


(py 


On substitution in (132.6) the first term, which is independent of k, is removed by 
regularization (in accordance with the condition ?/k*>0 when k*—0); the second 
term gives zero on integration over the directions of p; the third integral is 
logarithmically divergent with respect to p’, and on taking it from p’~ |k’| (the 
lower limit of the range (132.7)) to some “cut-off parameter” A’ we get 

a 2 


A 
2 
3a k* log 


ky (132.8) 
In regularizing, we have to subtract from #/k’ its value for k?=0. But, since 
logarithmic accuracy presupposes the condition |k’|>m/’, in calculation to this 
accuracy the regularization is achieved by subtracting the value for |k’|~ m’, and 
A’ in the argument of the logarithm is replaced by m’, giving (132.1). 
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Since the desired corrections to and I are logarithmic, their inclusion makes 
these quantities differ from G and y" by slowly varying logarithmic factors. In the 
exact integral (132.4), therefore, the important range will be the same one (132.7) as 
in the approximate integral (132.6). However, we cannot simply put k =0 in 
I“(p +k, p;k): since the integral is quadratically divergent, its regularization in- 
volves also the next two terms in the expansion of I[’(p + k, p; k) in powers of k. 
Here we shall consider only the corrections to I'"(p, p; 0), which show sufficiently 
clearly the importance of the choice of gauge and the difference in the nature of the 
integrals arising from diagrams of different types. It may also be noted that a 
similar analysis of Y is not needed, since the corrections fo I and & are related by 
Ward’s identity (108.8). 

The first-order correction (with respect to a) to ['(p, p;0) corresponds to the 
diagram 


ko 
P, Py 
p PA p 
and hence to the integralt 
Pa Sid | y*G(pi)y"G (pi) y’Dy.(p — pid*pi lay. (132.9) 


In the ordinary gauge, 


Dy. (Pp — P1) = @v ° 4al(p — Pv’, 


and the important range in the integral is p?>p’, where it is logarithmically 
divergent. On calculating the integral 


N a 4 
Pe = — Arai i OP Yn (132.10) 


and regularizing the logarithm, we get 


2 


7 6) aaa p 
r rai: log me 
In the Landau gauge, (132.10) is replaced by 
TO = —4rai i {y*(ypiy" (yp va — Piy"} d*pil(pi) Aa). 


+ To avoid misunderstandings in a comparison with the results of 8117, we may note that in §117 the 
two electron ends of the diagram were assumed to be physical, whereas here we assume that 
p’ > |k’|> m’, and these two lines are therefore certainly non-physical. 
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Averaging over the directions of p, and reducing the matrices y, we find that this 
integral is zero and the logarithmic term in I“? disappears.t 
In the second-order corrections (with respect to a) we take the diagram 


The corresponding integral is 


PX = —a? | y'G(p2)y’G(pdy"G(py"G(P2)y" x 
x D,,.(P2— p1)Dyo(p — pr)d*pid*p2/(27)’. 


In the ordinary gauge of the D functions, this integral includes a term in which the 
logarithm ts squared, arising from the range of integration 


pi>pi>p?: (132.11) 


when p2 is neglected in the argument of D,,(p.— pj), the integration over d‘p, 
becomes the same as in (132.9) and gives log p3, and the subsequent integration 
over d‘p, is likewise logarithmic, giving log’(p3/m’). When the Landau gauge is 
used for the D functions, however, the logarithmic terms disappear from both 
integrations. 

A similar situation occurs for all the other diagrams in the skeleton diagram 


(132.12) 


Diagrams of other types, with intersecting photon lines, for example those in the 
skeleton diagram 


ony (132.13) 


+ The corrections to G”' in both gauges, as derived from the correction ! by means of the identity 
(108.8), are of course in agreement with the results of §119. 
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(cf. (106.11)), do not in any gauge contain terms involving the necessary power of 
the logarithm; there is no range of the variables in which the integral reduces to 
several successive logarithmic integrations. 

These arguments, and similar ones for the subsequent terms in the expansion of 
IT in powers of k, confirm that in the Landau gauge there are no corrections to G 
and I’ involving the necessary powers of the logarithm; thus the expression (132.1) 
is in fact valid even if the condition (132.3) applies. 

The function 9(k*) which corresponds to the polarization operator (132.1) is 


An 1 


2k) = 32 [= (alm) loge 


(132.14) 
Because of the condition (132.3), there is no need to expand this expression in 
powers of a. 
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The applicability of (132.14) is, however, limited on the side of large |k’|, 
because the denominator decreases. The derivation of that formula was based on 
neglecting the diagram (132.13), and others with even more intersecting thick photon 
lines, in comparison with (132.12). But each such added line brings into the diagram a 
factor e?@, with D the exact propagator. The small parameter is then not a = e’ but 


a 
1—(al3m) log k4/m? oak? * I (13321) 


When |k’| increases and this quantity becomes comparable with unity, the small 
parameter essentially disappears from the theory. 

The resulting situation can be more clearly understood if the renormalization in 
the derivation of (132.14) is made not ‘‘en route’? but through the use of an 
“intrinsic” electron charge e,, which will here be chosen so as to give the correct 
value of the observed physical charge e ($110). If the integral 1s ‘‘cut off’, as was 
done previously, at some upper limit A’, the intrinsic charge is a function e,(A’) and 
the limit A->© must finally be taken. 

With this treatment, the polarization operator is 


Eritigstl 
P(k?) = 3% K log ia 


(the expression (132.8) with e. in place of e), and therefore 


a ere 
Dk) = 75 Tem) log Ae)’ (133.2) 


Now determining the physical charge e from the condition 


e2D(k?) > 42e7/k*> when k?> ~m’, 
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we have 


2 
Di Bote 5 Sh es 
e = Taneua Ry ORCI’ (299) 


or 


Z 


7 e 
~ 1 —(e7/3 7) log(A2/m’) eoH) 


e2 


If we formally take the limit A> © in (133.3), then e’—0 whatever the form of 
the function e(A). This “zeroizing’” of the charge means, of course, that no 
rigorous renormalization is possible. The limit cannot be taken, however, without 
violating the assumptions made in deriving (133.3). It is seen from (133.4) that, as A 
increases (for a given value of e’), e2 also increases; and the formulae cease to be 
valid even when e2 ~ 1, since their derivation is based on the assumption that 


<1 (133.5) 


as the condition for perturbation theory to be applicable to the ‘“‘intrinsic’”’ 
interaction. The failure of the inequality (133.5) to be satisfied as A increases is of 
fundamental significance. It shows that quantum electrodynamics is logically in- 
complete as a theory that is based on weak interaction. This essentially implies that the 
existing theory as a whole ts logically incomplete, since its entire formalism is based on 
the possibility of treating the electromagnetic interaction as a weak perturbation. All 
the quantities calculated by the theory are found as series in powers of e2, which are in 
fact asymptotic series. In order for them to have a definite meaning when e2 is not 
small, further arguments would be needed which do not follow from the general 
principles of the existing theory. 

It must also be emphasized, however, that in quantum electrodynamics these 
difficulties cannot be more than theoretical ones. They arise at enormous energies 
that are of no practical significance.t We may expect that in reality the elec- 
tromagnetic interactions will very much sooner be ‘“‘merged”’ with the weak and 
strong interactions, so that pure electrodynamics is no longer meaningful.¢ 

To conclude this section, we shall show how formulae (133.3) and (133.4) can be 
derived by simple arguments based on the significance of renormalization and on 
dimensional reasoning (M. Gell-Mann and F. E. Low, 1954). 

Let us consider the square of the unrenormalized charge as a function of the 
cut-off parameter, e2(A’), and define a function d as the ratio of the values of e2 for 
two different arguments: e2(A3) = e2(Aj)d. When Aj, A3>m, the function d is 
independent of m and, being dimensionless, can depend only on the likewise 


+ For example, the equation (a/7) log(e’/m’) = 1 is satisfied when ¢ ~ 107m. 

+ The opposite situation occurs in theories where the interaction between particles is mediated not 
by the electromagnetic field but by Yang—Mills fields. The relation between the renormalized and 
unrenormalized charges in such theories is given by an expression of the type (133.4), but with the 
opposite sign in the denominator, so that, for a given value of e’, the unrenormalized charge e2 
decreases with increasing A. This is called asymptotic freedom of the theory. Such a theory is, of 
course, fundamentally different from the theory with zeroized charge. 
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dimensionless quantities e2(Aj) and A3/At: 
e(A3) = e(Ai) d[ec(Ai), A3/Aj]. (133.6) 


From this functional relation, we can derive a differential equation, by writing it 
for infinitesimally close values of Aj and A3. Denoting Aj by é and putting 
A} = &+ dé, we obtain for a,(€) = e2(Aj) the differential equation 


da, = 6(a,)dé/€, (133.7) 
with 
(ac) = a[dd(ac, X)/OX]x=1; (133.8) 


we have used the fact that d(a., 1)=1, from the definition (133.6). Integration of 
(133.7) from € = Aj to € = A} gives 


e2(A3) 
log( A2/A2) = | dei). (133.9) 


e2(A?) 


Throughout the range of integration, e2 is small. We can therefore use for $(a) 
the expression corresponding to the first approximation of perturbation theory. The 
correction to the unrenormalized charge e2 is e2k*P(k’). Taking the first ap- 
proximation (132.1) for the polarization operator, we find 


d (a, A3/A4) = 1+ (a/37) log(A3/Aj), b(ac) = ae/3-7, 
and the integration in (133.9) then gives 


ee ee en on 
a BAT OHA CMA ory 
As Aj>~m7’, the unrenormalized charge e.(Aj) tends to the actual charge e, and 
(133.10) then agrees with (133.3) and (133.4).t 


§ 134. Asymptotic form of the scattering amplitudes at 
high energies 


Let us consider the asymptotic form (at high energies) of the amplitudes and 
cross-sections for two-particle scattering processes (1+2-—>3+4). For the basic 
electrodynamic processes in the first non-vanishing approximation (with respect to 
a), this problem can be solved by means of the specific formulae derived in the 
preceding chapters, which are valid at all energies. Here, however, we shall discuss 


+ The renormalization group method based on the functional properties of propagators and vertex 
parts is systematically developed in the book by Bogoliubov and Shirkov cited at the end of §112. 
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the question from a more general standpoint, which enables such asymptotic forms 
to be derived directly. 
As in §66, we use the invariants 


S =(p,+ pr)’, t = (pi— ps)’, u = (pi Da)’, (134.1) 


with p; + p2 = p3+ p4; the notation corresponds to reactions in the s channel, which 
will be considered here. In the ultra-relativistic case, when the energies are much 
greater than the particle masses, the energies of the two particles in the centre-of- 
mass system are approximately equal. We denote by « the sum of the energies of 
the colliding particles, and then have, in the centre-of-mass system, p\ = Ge, Pi), 
P2 = (€, —pi), P3 = Ge, ps), Pa = Ge, —ps), pt = ps =4e’, with 


s =e’, t=—4s(1—cos 6), u = —38(1+ cos 8), (134.2) 


6 being the angle between p, and ps3. 

Let us first consider the asymptotic form of the reaction cross-section for a 
fixed value of the scattering angle @. Then all three variables s, t and u are 
proportional, and tend to infinity together. In the ultra-relativistic case, the particle 
masses cannot appear in the result, and the only quantity having the dimensions of 
length is l/e (=hc/e). Hence it follows from dimensional arguments that the 
differential cross-section for two-particle reactions decreases with increasing 
energy in the asymptotic form 


do/dox<1/s as s,|t|,|u|>~. (134.3) 


If the cross-section is related not to the solid-angle element do but to the 
differential dt, we have, since do « dt/s, 


do|dt « 1/s’. (134.4) 


The cross-section is expressed in terms of the scattering amplitude (in the ultra- 
relativistic case) as do/do « |Myi|’/s; see (64.22), (64.23). The law (134.3) therefore 
means that in the asymptotic limit the scattering amplitude is independent of s: 


M,; = constant. (134.5) 


It is clear from the manner of the derivation that these results apply not only to 
the first non-vanishing approximation of perturbation theory, but also to higher 
approximations (those taking account of radiative corrections), if logarithmic 
factors (of the form log s/m’) are ignored; the dependence on dimensionless 
logarithms cannot, of course, be determined from dimensional arguments.t 

A different situation arises if s increases with t fixed, i.e. with a fixed square of 
the momentum transfer. This is scattering through small angles, which decrease as 


+ The summation of series containing logarithmic corrections may lead to an exponential depen- 
dence on the logarithms, which changes the exponent in the power law. This change is, however, small if 
a is small. 
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the energy increases: 
s >, |t|~ sé? = constant, 6 ~ (\t\/s). (134.6) 


In such a case, dimensional arguments allow us to establish only that the combined 
power of 1/s and 1/t in do/dt is 2, and in the amplitude M;; zero.t Hence, to find 
the part of the cross-section that decreases least rapidly with increasing s, we have to 
separate the factor that has the greatest power of 1/t. Such factors arise only if the 
Feynman diagram can be divided into two parts between the ends 1,3 and 2,4 by 
cutting the lines of virtual particles. The total 4-momentum of such lines is p;— ps, 
and this leads to the factor that depends on t = (p; — p3)’. Thus the asymptotic form 
of the diagram in the range (134.6) depends on the nature of the possible cuttings of 
diagrams in the t channel. 
Similarly, the asymptotic behaviour in the range 


so, |u|~s(a—6)'=constant, |a— 6|~ (ul/s), (134.7) 


corresponding to scattering through angles close to 7, is governed by the nature of 
the possible cuttings of diagrams in the u channel, i.e. between the ends 1, 4 and 2, 3. 

The simplest example is electron—electron scattering, described by the two 
diagrams (73.13) and (73.14). The first of these allows cutting in the t channel at the 
virtual photon line, and it determines the asymptotic form of the scattering amplitude 
in the range (134.6). The virtual photon line corresponds to a D function that is 
proportional to 1/t. The asymptotic forms of the amplitude and the differential 
cross-section are 


My & s/t, do & dt/t’. (134.8) 


In the limit (134.7), close to the backward direction, the asymptotic form is deter- 
mined by the ‘“‘exchange”’ diagram (73.14); in that limit, 


M; « s/u, do « dulu’. 


For mutual scattering of different particles (electron and muon), there is no 
exchange diagram, and so the cross-section for scattering through angles @ ~ 7 
decreases in accordance with (134.3) and (134.4).4 

We shall show that these results for the asymptotic behaviour of electron— 
electron scattering are unaffected by the inclusion of radiative corrections. To do 
so, let us consider the corrections of various orders to the diagram (73.13). 

It has already been shown that the diagrams which are corrections to the 
internal D function (see (113.11)) or to the vertex parts (see (117.1)) lead only to 
logarithmic corrections in the amplitude; they do not alter the power law (134.8). 
We shall show that the same is true of the diagram allowing cutting at two (not one) 


+ These arguments assume a constant |t| > m’. The results thus obtained remain valid, as regards the 
dependence on s (i.e. on the energy), even when |t| ~ m’. 

+t All these statements are, of course, in accordance with the results of §81; see (81.11) and Problem 
6. 


QE4 - NN 
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internal photon lines: 


p,-P,-d A Ag (134.9) 


The scattering amplitude corresponding to this diagram differs from that cor- 
responding to (73.13), in that the factor 1/t is replaced by 


(Pit Wy@2- 4) gig 
(pit q)y(p2— q)'q"(ps—Pi- gq)” 


followed by integration over d*q. The important range of integration is the one 
which gives rise to the lowest power of 1/s. For this, g must always be small in 
comparison with p,; and p2. Rejecting the terms which are then small (and also the 
terms pj = p}=™m_’), we can rewrite this expression as 


en © 7/2110 7/7) aa d*q 
(piq)(p2q)q'(p3—Ppi-qy 


The denominator does not contain s if qo and q, (with the x-axis along p; = —p) are 

x i/Vs; dy and q, may be « V\t\; then the range of integration « 1/s. The order of 
magnitude of the numerator is pip.2%«s. Thus the replacement of one internal 
photon line in the diagram by two does not affect the dependence of the diagram on 
s (for a given t).t That is, the contribution of the diagram (134.9) to the scattering 
amplitude has the same asymptotic behaviour (134.8) as that of the principal 
diagram. The position is unaffected by adding other parallel internal photon lines in 
the diagram, and also by including corrections to the internal electron lines. 

This is a general result: any diagram which can be cut in the t or u channel into 
two parts across any number of internal photon lines corresponds to an amplitude 
contribution which has the asymptotic form M,;; « s/t with t constant or s/u with u 
constant (V. G. Gorshkov, V. N. Gribov, L. N. Lipatov and G. V. Frolov, 1967; H. 
Cheng and T. T. Wu, 1969). 

As a second example, let us consider Compton scattering described by the two 
diagrams (74.12). These do not allow cutting in the t channel, but the second 
diagram can be cut in the u channel at an internal electron line. In the notation of 
the present section, it 1s 


2 ee ee eee) 


fP.-°, (134.10) 


prt, ee 


This means that the scattering is largely concentrated near the backward direction, as 


+ Let us mention again that only power-law asymptotic forms are under consideration, and so we 
need not take account of logarithmic divergences in the integrations. Diagrams of the form (134.9) will 
be further studied in §137. 
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already noted at the end of 886; see (86.20). To find the asymptotic behaviour in 
this range, we note that the factor G corresponding to the internal line in (134.10) is 
in order of magnitude 1/y(p;— ps) « 1/V|u|. Hence the scattering amplitude My ~ 
a(s/|u|)?; this includes a factor a because the diagram (134.10) is of the second 
order. The differential cross-section is therefore dao/du « a*/|u|s. The integral of this 
expression with respect to |u| is governed by the range |u|<s. The total cross- 
section then decreases with increasing energy as 0 « a7/s, or more exactly o « 
(a’/s) log(s/m7); cf. (86.20).+ 

For this process, however, radiative corrections alter the asymptotic behaviour. 
The change is due to sixth-order diagrams of the type 
— ee e 


i 3 | | 


— (134.11) 


In the t channel, these allow a cut across two internal photon lines, and therefore 
contribute to the amplitude, with asymptotic form My, %a’s/t; the factor a° 
corresponds to a sixth-order diagram. When s is sufficiently large, this part of the 
amplitude becomes the principal one, and the differential cross-section is 
then 


do/dt « a°/t?. 


The integral of this expression with respect to t is governed by the range of small 
|t}~ m’, i.e. by the range of scattering angles 6 ~ mVs; note that the scattering 1s 
now mainly forward, not backward. The total cross-section then no longer 
decreases with increasing energy: 


a <a°/m’?= a'r. (134.12) 


The decreasing part of the cross-section becomes comparable with this constant 
part when ¢ = Vs « m/a’. 

A similar situation occurs for photon—photon scattering. In the first non- 
vanishing approximation, this is described by the “‘square’”’ diagrams (127.1), which 
can be cut across two internal electron lines. Integration is carried out with respect 
to the 4-momentum of these lines in the diagram; momenta ~Vs are important, 
and small values of t or u are not especially significant. The asymptotic form of 
these diagrams for any constant t or u is given by (134.5): My, = constant « a’. The 
total cross-section decreases with increasing energy: o « a*/s (cf. (127.23)); angles 
close to zero or 7 have no special significance here. In the eighth order, however, 
there are diagrams which can be cut (in the t or u channel) across two internal 


+ The exact form of the dependence of the cross-section on |u| or |t| when these are <m’ cannot, of 
course, be ascertained from the arguments given here. It is assumed that the integral with respect to |u| 
or |t} converges at values ~ m’. This is in fact so for all processes except elastic scattering of charged 
particles. 


608 Asymptotic Formulae of Quantum Electrodynamics § 135 
photon lines, for example 


I l 


i (134.13) 
oe) tel ee 


“<--> 


These diagrams give an asymptotically constant cross-section: 0 « a°/m* when 
Vs> ml a?.t 

The asymptotic constancy of the total cross-section is a characteristic property 
of scattering processes whose diagrams can be cut (in the t or u channel) across 
internal photon lines. It occurs even when more than two particles are present in 
the final state of the reaction. 


§135. Separation of the double-logarithmic terms in the 
vertex operator 


The corrections having the form (aL)" (where L is the large logarithm) can 
become important only at enormous energies, as already mentioned at the end of 
§133, and therefore are of purely theoretical significance. But there are also much 
larger corrections, of the form (aL’)", in the amplitudes of actual scattering 
processes. Such terms, containing the square of the logarithm to the same power as a, 
are called double-logarithmic terms. 

The characteristic expansion parameter in the double-logarithmic corrections 1s 


(a/7) log’(e?/m’), (135.1) 


where ¢« denotes the energies occurring in the problem (for example, the total 
energy of the colliding particles in the centre-of-mass system). The condition for 
perturbation theory to be valid is that this quantity be small; it ceases to be satisfied 
at energies 


e ~m exp]xV(n/a)] ~ 3 X 104m. (135.2) 
Let us now try to avoid this limitation and derive formulae valid when 
(a/ 7) log’(e?/m’) S 1. (135.3) 


This clearly requires the summation of an infinite series of corrections in all powers 
(aL’)". 

The double-logarithmic corrections occur in cases of two kinds. One in- 
cludes scattering through a fixed finite angle; as has been shown in §134, the 


+ The cross-section for coherent scattering of a photon in the field of a nucleus is asymptotically 
constant even in the first non-vanishing approximation, described by “‘square” diagrams in which two 
ends are external-field lines; see (128.7). In reality, however, these diagrams would have to be 
represented in the form (134.11), the upper continuous line being the nucleus line. The external-field 
lines then become internal lines in the diagram, and the reason for the asymptotically constant form 
becomes evident. 
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cross-sections always decrease in the asymptotic high-energy range. In such cases, 
the double-logarithmic corrections are closely associated with the infra-red diver- 
gence. These cases include, in particular, elastic scattering of electrons in an 
external Coulomb field; the first double-logarithmic correction to the cross-section 
has been found in §122. The present section and $136 deal with the complete 
determination of these corrections, under the condition (135.3). 

The other class of cases includes reaction cross-sections which decrease with 
increasing energy for a given square of the momentum transfer, i.e. for scattering 
angles which asymptotically approach zero or 77; as shown in §134, this occurs for 
processes whose diagrams cannot be cut across internal photon lines in the t or u 
channel. Here, the double-logarithmic corrections are not connected with the 
infra-red divergence. As an example of this, we shall discuss in §137 the problem of 
backward (u =constant) electron—muon scattering. 

First of all, with the condition (135.3) the single-logarithmic corrections are 
~(al 7) log(e?/m’) Ss V(a/7) <1 and can therefore be omitted. Since the double- 
logarithmic corrections do not appear in G and &, the latter functions can now be 
taken as simply equal to their unperturbed values G and D. 

The calculation of the vertex operator [ involves the summation of the 
double-logarithmic terms which arise from an infinite sequence of diagrams. This 
problem will be analysed in §136, but first a method will be described for separating 
the double-logarithmic terms in the various Feynman integrals before actually 
performing the integrations over all the variables in them (V. V. Sudakov, 1956). 

Let us consider the first-order correction (with respect to a) to the vertex 
operator, represented by the diagram (117.1), which is here conveniently taken 
(with a renaming of the variables) in the form 


(135.4) 
or, analytically, 
OG. pegs _ie* y’ (yp2— vf + m)y" (yp. — yf + m)y,d*f 
ai 4m° J [((po— fy’ — m’ + i0][(p1—f)°— m? + iO][f* + i0y 
(135.5) 
We shall assume that 
\q>|> pi, p3, m’, (135.6) 


and that the ends p;, p2 may be eithet physical or virtual. From (135.6) it follows 
that 


[pip] ~ 21471 > pi, p3, m’, (135.7) 
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i.e. the 4-vectors p,; and p2 have large components but small squares; this is 
possible because the four-dimensional metric is pseudo-Euclidean. The double- 
logarithmic terms in fact occur when the conditions (135.6) are satisfied. 

We shall see below that relatively small values of f are important in the 
integration over d*f. We can therefore neglect f in the numerator of the integrand, 
and I” becomes 


ie” 


pe) = are y’(yp2tm) y"(ypi+m) yylh, (135.8) 
where 
_ d' 
b= | Spm ON SOS ce 


The matrix factor in (135.8) can be simplified by using the fact that when [ 
appears in diagrams it is always, in effect, multiplied by the matrices (yp.+m) and 


(ypit+m): 


(yp2+ m)T(ypi +m). (135.10) 


For, if the lines p; and p> are virtual, these factors come from G(p,) and G(p2); if 
the lines correspond to real electrons, I’ is multiplied by # and u,, and Dirac’s 
equations show that 


Interchanging the order of the matrix factors and neglecting at each stage, in 
accordance with (135.7), the squares p{, p> and m’ in comparison with p p2, we get 


l 


- 2 
e 
(yp2t+ m)P*(yp, + m) = 6 (pip2)(yp2+ m)y*(ypit m)h. 


We can therefore put I” in the final form 


[* = (ie?/27°) y"th,, (135.11) 
where 


t= q> =~ —2(pip»). (135.12) 


The integral I, converges when f is large, and therefore need not be regularized. 
The main point of the subsequent calculations is the introduction of new and 
more convenient variables of integration. Let f be resolved into components 
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tangential and normal to the p;p2 plane: 


f= up, + vp2t fi 
=fit fi, 


fiP: =fip2=9. 


As new variables, we take the coefficients u and v and 


p =—fi. 


611 


(135.13) 


(135.14) 


(135.15) 


It is evident from the conditions (135.7) that the metric in the pp. plane is 
pseudo-Euclidean. The time axis can therefore be taken in this plane, so that f, isa 


space-like 4-vector and p > 0. 


Let the indices 0 and x temporarily denote the components of 4-vectors in the 
Pip2 plane; y and z those in the normal plane. To transform the 4-volume element 


d*f = d’f.d’f 
to the new variables, we write 


d*f, = |f.| d|t.| db 
=;dpdd—> x dp 


(since the integrand in (135.9) is independent of the angle @). Also 


es O(fo, fx) 
n= Feet 
= |DioP2x — P20Pix| du dv 

~ 4\q°| du dv, 


du dv 


since p3 is small, so that p3, ~ p3) and 
(DP 10P2x — PrP ix) = (Di P20 — PoxP ix)” 
= (pipr)’ = Gq’). 
Thus 


d*f =4|t| du dv d’f, 
> 4q|t| du dv dp. 


(135.16) 


The calculations now depend on the relation between pi, p3 and m’; two cases 


will be considered. 
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VIRTUAL ELECTRON LINES 
Let the momenta p, and p2 correspond to virtual electrons, with 


[pil, [p3| > m?. (135.17) 


We shall see that the most important range of integration, which leads to a 
double-logarithmic expression, is in this case given by the inequalities 


0<p <|tul, |tv], 
[pi/t| <|v| <1, |p3/t|<|u| <1. (135.18) 


Accordingly, in the denominator of the integrand in (135.9) we can neglect m’, p7, 
p3 and f* in comparison with (p,f) and (ppf): 


4 
I, = NOS TESIES TEE (135.19) 
The quantities pif, pof and f? are given by 
f* = (up: + vp2)’— p ~ — tuv — p, 

2(pif) = 2p\(up; + vp2)~ — tv, 

2(pof) = — tu. 
Then 

I= Sees er ae (135.20) 


In accordance with the conditions (135.18), the integration over dp is taken from 
0 to the smaller of |tv| and |tu|; the result is 


min (|tu|, |tv]) 
dp 


1 1 i when tuv <0, 
ee a mip {‘o (135.21) 


Sune min{ E 0 when tuv > 0. 


The logarithmic integration over dv is taken from —1 to —|p{/t| and from |pj/t| to 1 
(and similarly over du). When (135.21) is substituted in (135.20), the integral of the 
first term over dudv is zero, because the integrand is an odd function. The 
integration of the second term is carried over ranges of u and v having the same 
sign when t <0 and opposite signs when t > 0. In either case the ranges v > 0 and 
v <Q give the same contribution after integration over du, and the result is 


1 1 
im? du dv _ im 
[oy | ar | a Fe 
/t| 


[p [p4/t| 


log 


t t 
2 = (135.22) 


the sign being the same as that of t. 
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Finally, substituting in (135.11), we get 


q’ 


P*(p», pis q) = —a— y" log 4, log | (135.23) 
la’|> cs |p3| > m°. 
PHYSICAL EXTERNAL ELECTRON LINES 
Now let the momenta p, and p> correspond to real electrons, so that 
pt=pi=m’. (135.24) 
Then the important range of integration is 
0<p <|tul, |tv], 

0<|v|, |u| <1. (135.25) 


Since pj— m’= p3— m’=0, we can neglect pj and p3 in comparison with pif and 
p2f, and again bring the integral (135.9) to the form (135.19). To eliminate the 
infra-red divergence which then occurs, however, we must apply a finite photon 
mass A <m in the photon propagator (cf. $117): 


d“f 
t= } 20pf)- 2epaf) G2 A? + 10) e229) 


In this case 
f°’ =~ —tuv—p, 
2pif ~ —tv+2m°’u, 
2pof ~ —tut+2m’r, 
and hence 


2p. dp du dv 
a | ee eae (ae) 
where 7 = 2m/7/t <1. 

After the integration over dp, similarly to (135.21), we obtain 


I,=- 


us u—-—Tvv-TU’ 


the integration being subject to the condition tuv +A*<0. The ranges v >0 and 
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v <Q again make equal contributions, and the result of the integration over du is 


+ fa f eet 


6/ v 


1 

_ in’ 75 — v° dv 

tt [ toe (6—Tv’\(tT-—v) v’ e242) 
0 


where 5 = A’t, |5|<|z|, and we have used the inequality |z| <1. 

In the integral (135.28), three ranges of v lead to double-logarithmic 
expressions: (I) |r|<v <1, (II) V(8/t)<v <|r|, (II) V (78) Kv « V (8/7). (We 
take the specific case V (6/7) <|z|; the result does not depend on this assumption.) 
With the appropriate approximations in each range, we find 


_ im? 2 |t] {tl m 
I, = ¥ (log mh+4log “biog ™), (135.29) 


Finally, substituting in (135.11), we have 


2 2 
u(1) ; Zi, on) 214 q 7) 
Pps, psa) = ~Zey" (tog 4+ 410g Hiog™), (135.30) 
lq’|>pi=pi=m’, 


in agreement with (117.21). 


§ 136. Double-logarithmic asymptotic form of the vertex 
operator 


When the corrections I calculated in §135 become of the order of unity, the 
vertex operator has to be found by summation of the infinite sequence of double- 
logarithmic terms of all orders in a. This problem can be solved because such terms 
arise only from diagrams of a particular type, and the contributions from diagrams 
of different orders are related in a simple manner. As we shall see below, the 
double-logarithmic terms arise from all the diagrams that have the form 


(136.1) 
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etc., in which each photon line joins the two electron lines, and the photon lines 
themselves may intersect in any manner. 

Let the photon momenta f,, f.,... be numbered in the sequence of, say, the 
right-hand ends of their lines. Then the various diagrams of a given order will differ 
in the sequence of the left-hand ends of the photon lines. In each Feynman integral, 
we neglect terms in the numerator and denominator as in (135.5), and then treat the 
numerator in the same way as in the derivation of (135.11). Then the sum of all the 
diagrams having n photon lines, giving the term <a” in IT, is 


Pe") = 4" Catlla y1,, (136.2) 


_ i d*fi...d"fr 
" 2, 2(pifs) + 2(pifi + pife)... 2(pifit- +++ pifn) + 2(prfr)...2Qpofit- +++ prof fifa... fa 
(136.3) 


where the sum is taken over all interchanges (permutations) of the subscripts k in 

the products p2f,; the terms i0 and A’ in the denominators are omitted for brevity. 
It is clear that, if the subscripts k in the products p;f, in the sum (136.3) are 

interchanged in any manner, this is simply equivalent to renaming the momenta and 

so does not affect the value of I,. We can therefore extend the summation in (136.3) 

to all interchanges of the factors f, in both pf, and pif,, and divide the result by n!. 
We now make use of the important formula 


! edt gait 
Ga Gare am a a ae (136.4) 


where the sum is taken over interchanges of the subscripts 1, 2,...,n.f When this 
formula is twice applied to the sum of integrals, we obtain a product of n identical 
integrals of the form (135.19) (or (135.26)), so that 


I, =— Ih. (136.5) 


Substitution in (136.2) and summation of I” over all n =0, 1, 2,... gives finally 
(po, pis q) = y" explie*th/27°). (136.6) 


In particular, substitution of I, from (135.22) gives the double-logarithmic 
asymptotic form of the vertex operator with virtual external electron lines: 


64 


I’ (po, pis q) = y" exp | 5, 108 d 


q q 
Pi 5 


lo 
e P2 


}, (136.7) 


Ja’1> [D3], [p3]> mm 


(V. V. Sudakov, 1956). 


+ The formula is obviously true for n = 2, and can easily be proved by induction. 
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Substitution of I, from (135.29) gives the asymptotic form of the vertex operator 
for real external electron lines: 


2 2 
I“ (po, pis; q) = y" exp| — (Iog? 24+ 4 jog 4 og alt (136.8) 


la|> pi=pi=m’. 
The factor which distinguishes this * from its unperturbed value y” defines also 
the difference between the amplitude for electron scattering in the external field 
and its Born value. The scattering cross-section is therefore 


2 2 
dep dances { - a (log? Ia] digg a log al (136.9) 


To eliminate the infra-red divergence, we still have to multiply this expression 
by the sum of the probabilities for the emission of various numbers of soft photons 
with energy not exceeding some small value w,,,, Le. by the quantity (see (122.2)) 


® max 


iz | dw, +35 | dw, | dw, ++-=exp} | dw} (136.10) 
0 0 0 0 


The integral in the exponential is given by (120.14) (as the expression which there 
multiplies do.,), and the final result is the following asymptotic formula for the 
cross-section for scattering of an electron with energy e« at a high momentum 
transfer: 


2 
do = dog exp{ ~2 tog 1} tog \, (136.11) 


E 
@ max 
lq’7|>m*, (a/27) log’(e/m) ~ 1 


(A. A. Abrikosov, 1956). The first-order term (with respect to a) in the expansion of 
this expression 1s, of course, (122.12). 

Note that, if we put wmax~e, one of the logarithms in (136.11) becomes of the 
order of unity; that is, the double-logarithmic corrections cancel in the cross- 
section for simultaneous emission of photons of any energy.f In the approximation 
used, the exponential factor in (136.11) then becomes unity, and the cross-section 
has its Born value, in agreement with the general statement at the end of 898. 


§ 137. Double-logarithmic asymptotic form of the 
electron—muon scattering amplitude 


As an example of the second kind, let us consider the scattering of an electron 
by a negative muon, taking only the case of scattering exactly backwards through an 
+ For scattering through a finite angle, the condition formulated in §98 for the photon to be soft 


requires only that wmax <e, and so the formulae derived here can be used with logarithmic accuracy 
even when @max ~ €. 
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angle @ = 7 (V. G. Gorshkov, V. N. Gribov, L. N. Lipatov and G. V. Frolov, 1967). 
This is a simple process in two respects. Firstly, exchange diagrams do not appear, 
because the two particles are not identical. Secondly, in back-scattering there is 
very little emission of soft photons, and therefore no infra-red divergence: accord- 
ing to (98.8) the soft-photon emission cross-section is 


dwdo, Hess 


! ' 2 

do = «| (Yt — a ee ee at) xal : 
1 An “w 

(137.1) 


=—vorm. Levin fey, nm. Laveen 


where y,, v, and vi, v, are the particle velocities before and after the collision; in 
the ultra-relativistic case equality of momenta implies equality of velocities, and to 
this accuracy, in the centre-of-mass system, for back-scattering, v. = —v, = —V.= 
v,, so that (137.1) is zero. 

If the scattering process considered corresponds to the s channel of the reaction, 
in the t channel it becomes the conversion of an electron—positron pair into a wp 
pair. In this channel the condition 6 = 7 signifies that the directions of motion of e7 
and yw (and of e” and pz”) coincide. The elimination of the bremsstrahlung in this 
channel is particularly clear, since the direction of motion of the charge of each 
sign is unchanged. 

The cancelling of the leading terms in the emission cross-section has the result 
that its asymptotic form does not contain double-logarithmic corrections. Cor- 
respondingly, there is (with the same double-logarithmic accuracy) no infra-red 
divergence even on integration over the momenta of the virtual photons in the 
scattering amplitude. 

If the process is described by means of the invariants 


s=(pe+p,), t=(Pe—pd), u=(pe—DP,); 
the values corresponding to back-scattering in the ultra-relativistic case are 
s=-t>m,, u=0. (137.2) 


In the first approximation (with respect to a) of perturbation theory, electron— 
muon scattering 1s represented by the diagram 


PG rtm, Pe 
I 
{Pe—p: (137.3) 
Pi RRS aun! Sa eER et 
The corresponding amplitude is 
M‘ — Ama a" y"u (uO y,u). (137.4) 


[ 


The value of this expression in the limit (137.2) is obtained by replacing the matrix 
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4-vector y” by its “projection” y; on a plane perpendicular to the p,p; plane (or, 
equivalently, the p,p, plane, since in ultra-relativistic back-scattering p, ~ p,, and 
Dp. ~ p,): the components parallel to the p.p, plane are the matrices 


1 ? 1 ? 
ee (yDe + YP e)s We (yPe — YP oe) 


(the first being equal to y° and the second to n,. - y, with n, a unit vector along p.), 
and on using Dirac’s equations for the bispinors u“ and u™), we have 


a’ ypu) Ga” y ju) 203 1/s, 


so that these terms may be omitted. 
In the next approximation we add the diagram 


I Pet pP.f (137.5) 


and a diagram with the photon lines “‘crossed’’, which is conveniently taken in a 
form which differs from (137.5) only as regards the direction of one of the 
continuous lines: 


! 
fp; pe~f 
Pe (137.6) 


An analysis of the corresponding integrals shows that in both diagrams we have 
double-logarithmic contributions from the regions of “soft”? virtual photons: |(f — 
De) |<miz or |(f — p)’|<m%. These contributions arise from the infra-red diver- 
gences of the integrals and, according to the foregoing discussion, must certainly 
cancel here. In the diagram (137.6), however, there is a double-logarithmic con- 
tribution also from large momenta: |f*|> m2, and this contribution must be cal- 
culated. 

The diagram (137.6) corresponds to the integral 


M? = _ ia [ay Of t+ me)yu ay vf + m,)yu™) d’f, (137.7) 
(pe - f) (f? — m2)(f? — m+,)\(p. — f) 
where we we have already used the fact that p, ~ p),. We again put 
f=up.+vp.et+fi (137.8) 


(cf. (135.13)). The double-logarithmic contribution comes from the range defined by 
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the inequalities 


|su|,|sv| > p> mi, 


m2/s <|ul, |v] <1, (137.9) 


where p = —f{. The 4-vector f, is defined so that f:p. = f,p/= 0; in the present case 
of back-scattering, it follows that f? = 0 in the centre-of-mass system, and so p = f?. 

In the numerator in (137.7) we can neglect m, and m,, as well as all terms 
containing u or v; the factors u and v there would cancel the corresponding poles 
in the denominator (see below), and so the required squares of logarithms would 
not occur. Since 


(pi-fY ~tu~—su, (p.-fyY~—sv, f? ~suv—p, 


we can transform the element of integration d*f in accordance with (135.16) and 
rewrite (137.7) as 


ane —(e)r ov (e)yp ley nA (u) 
 _ _ lat f lh" y’ Of) ye ey" of) yu] 2 
Mii 55 | su - sv(suv —p +i0) Serena 


The numerator in the integrand is further transformed by averaging over the 
directions of f, and replacing y” and y* by yi and y{ (on the same principle as in 
(137.4)). A simple calculation gives 


MP = MSs, 
yo = — tae | p du dv dp (137.10) 
4m? J uv (suv — p + i0)” 


Finally, using in the numerator the identity p =(p — suv)+ suv, we can omit the 
second term, which would cancel simple poles and therefore give no double- 
logarithmic contribution. Thus 


yo ee ia | du dv dp 


Seo ee 137.11 
4m° J uv (p— suv — i0) Cn) 

This integral has the same form as (135.20), and the integration over dp can 
therefore be performed in the same manner, but since now p >mz, we have the 
condition suv > m;, instead of suv >0. The result is 


(137.12) 


the range of integration being defined by the inequalities 
mils<u,v<1, suv>mz;; 


in the calculation with logarithmic accuracy, the strong inequalities > are replaced 
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by > simply. A straightforward calculation gives 


Qa 


(Dis 
J At 


log? = (137.13) 


iv 


In the higher approximations of perturbation theory, the desired terms 
~a" log*"s arise from “ladder” diagrams similar to (137.6) but with a larger number 
of “rungs”. The complete double-logarithmic asymptotic form of the scattering 
amplitude is therefore given by the infinite sum 


Pe yma P, —_—— a 
iM, = + . ou a bE Gee (137.14) 


To determine the general form of the terms in this sum, let us consider the 
diagram for the third approximation (the third term in the series (137.14)). The 
corresponding integral may be written 


M® = MPVs, 
jou (-) [ —terder die dea (137.15) 
QT U,V, (Uy + Ur)(v4 oe V2)’ 


with the range of integration 
2 2 
MiS SUy2, Vi2< 1, SUyv1, SUQV2 > Mj. 


The double-logarithmic term in this integral can be separated by applying to the 
variables of integration the further conditions 


V2> V1, U2> Uy. (137.16) 
Then 


j® = (¢-) du,dv,du,dv> 
277 U,U2V1 02 


ds 
=(-) | dé dm dé dns, 


where & =log(suj/m2), 7; = —log v;, and the range of integration is defined by the 
inequalities 


&>m, &>m oc >&m>), o = log(s/m?). 


Similarly, the nth term of the series can be written as Mj? = M$)J™, where 


Jo) = (=) | de Ohi. cde dan (137.17) 
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with the range of integration 
Ee 1,2, 20M) OSGi U- (137.18) 


The total scattering amplitude is 
Mj = M$? [ +> I™o)]. (137.19) 
n=1 


To calculate this sum, we use auxiliary functions A” (& 7), given by the same 
integrals (137.17) but with ranges of integration 


Et = Aplissaght);, -Gesg 20, Ho Be0 (137.20) 


(i.e. with different limits of integration with respect to &, and 7,, instead of uniform 
limits as in (137.18)). It is evident that M,; = M$) A(o, 7), where 


A(é 7) = > AE q), A =1. (137.21) 


The definition of the functions A” (é, 7) shows that they satisfy the recurrence 
relations 


AE, n= | dgiamA" ME nd, 


and summation of this equation with respect to n from | to & gives an integral equation 
for the function A(€, 7): 


A(é,n)=1+-— | A(é, 1) déidm, 
| (137.22) 


E>m, E>&>0, nom>d. 


For the subsequent analysis it will be sufficient to consider A(é, 1) in the range 
—€ >. Then equation (137.22) can be written 


n & 
AG m= 145% [ [ A, m0 dé dm. (137.23) 


0 n 


Differentiating this with respect to 7, we have 


dA(E, m) _ 


é 
Qa 
a | A(é:, 0) d&i, (137.24) 


QE4 - OO 
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and a further differentiation with respect to € gives the differential equation 


vA a, 
Sane” Oe A=0. (137.25) 


This has to be solved with the boundary conditions 
A(é,0)=1, [dA/dn]z-, = 9, 137.26) 
which follow immediately from (137.23) and (137.24). 
The solution can be found by means of a Laplace transformation with respect 


to &: 


AE n= 55 | €Q@, 0) ap, (137.27) 


Cc 


where the contour C in the complex p-plane is a closed curve around the point 
p =0. Substituting (137.27) in (137.25) and equating the integrand to zero, we have 


pP-2Q Q= oe, 


with @(p) an arbitrary function. The first boundary condition (137.26) now gives 


= 
$0) =— + (Dd), 


with w(p) an analytic function, having no singularity within C. The second 
condition (137.26) can be satisfied by putting w(p) = —27p/a: then 


dA = — de &(p+a/2mp) a 
Ee ~~ 2nié J dp a 
Cc 


Combination of these expressions with € = n =o gives 


| 2n 


2Tl aT 


A(oa, a) =— | p +p er7Ptal2mp) dp. 
Cc 


Finally, integrating by parts and using the familiar formula 


I,(z) = | o22(P+llp) dp 
C 


(where I,(z) = —iJ,(iz) is the Bessel function with imaginary argument), we find the 
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scattering amplitude 


My = M3? \24 I, ( “8 c). (137.28) 


The cross-section for scattering through @ = 7 is correspondingly 


27 2 S ) 
—~7,-)_ #7? oa 
do = do HOE! 5 Ti( 7 log 2 : 


(137.29) 
do = (2107/5?) dt 


being the cross-section in the Born approximation in the ultra-relativistic case (see 
§81, Problem 6).t 


+ References to further literature on double-logarithmic asymptotic forms are given in the review 
article by V. G. Gorshkov, Soviet Physics Uspekhi 16, 322. 1973. 


CHAPTER XIV 


ELECTRODYNAMICS OF HADRONS 


§ 138. Electromagnetic form factors of hadrons 


SO FAR, we have been discussing in this book the quantum electrodynamics of 
particles not capable of strong interactions (electrons, positrons and muons). There 
are also many particles known as hadrons,{ which take part in strong interactions. 
These include, for example, protons and neutrons with spin 3, pions with spin zero, 
and other particles. Atomic nuclei, which consist of protons and neutrons, are of 
course hadrons also. 

Present-day theory does not enable us to derive a complete electrodynamics of 
hadrons. It is clearly impossible to set up equations which determine the elec- 
tromagnetic interactions of hadrons without taking account of the considerably 
more powerful strong interactions. In particular, the latter must be included in 
order to obtain the explicit form of the hadron current, in order to describe the 
interactions in quantum electrodynamics. The hadron current will therefore be 
introduced as a phenomenological quantity whose structure is determined only by 
the general kinematic requirements independent of any assumption about the 
dynamics of the interactions. 

The electromagnetic interaction operator will again have the form 


e(JA), (138.1) 


where the current is now denoted by the capital letter J to distinguish it from the 
electron current j. Since the order of magnitude of this interaction is specified by 
the same elementary charge e, we can again use the methods of perturbation 
theory.§ 

Let us first establish the form of the transition current between two states of a 
hadron in free motion (without any transformation of the hadron itself). This 
current occurs in the three-ended diagram 


i 
9 


(138.2) 


Pz P, 


+ From the Greek hadros, ‘‘large, massive’. 

t Topics of hadron electrodynamics which involve the quark model will not be discussed in this 
book. 

§ In this chapter, e (>0) denotes the unit charge. 
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which itself may be part of a more complex diagram (for example, that for elastic 
electron scattering by a hadron). The broken line in the diagram (138.2) represents a 
virtual photon; it cannot correspond to a real photon, since a free particle cannot 
absorb (or emit) such a photon; and 


q’ =(p2— pi) <0. 


If we take first a hadron with spin zero, let u; and u, be the wave amplitudes of 
the initial and final states of the hadron, in which its 4-momenta are p, and p2; fora 
spin-zero particle these amplitudes are scalars (or pseudoscalars).t The hadron 
transition current J; between these two states must be bilinear in u; and ut. We 
can write it 


Ji =a usD uy, (138.3) 


where the 4-vector I 1s an unknown vertex operator (the circle in the diagram 
(138.2)). If we put u, = u,= 1, then J; =T-. 

The conservation of current is a universal property in electrodynamics, due to 
the gauge invariance of the theory. In the momentum representation, it is expressed 
by the orthogonality of the transition current and the photon 4-momentum gq = 


P2— Pi- 
qJ;, = 0. (138.4) 
Here, this means that [ must have the form 
T = PF(q’), (138.5) 


where P = p,+ p2 and F(q’) is a scalar function of q’, which is the only invariant 
independent variable. Since the type of hadron is unchanged by the transition, 
Pi = pi=M’, where M is the mass of the hadron, and hence Pq = 0. 

The matrix elements (138.3) with T given by (138.5), and therefore the operator 
J, are true 4-vectors. The interaction operator (138.1) is consequently a true scalar. 
Thus the electromagnetic interaction of spin-zero hadrons is necessarily P-in- 
variant. It is also T-invariant. Time reversal interchanges the initial and final 
4-momenta, leaving the sum P =p,+p, unaltered, and changes the sign of the 
space components of the 4-momenta but not the time components. This is also the 
way in which the components of the 4-potential A are transformed, so that the 
product JA is unaltered. 

The invariant function F(q’) is called the electromagnetic form factor of the 
hadron. The explicit form of this quantity cannot, of course, be established in a 
phenomenological theory, but it is real (in the region q*<0O at present under 
consideration), as follows from the same arguments as were used in $116 for the 


+ The plane wave is written in the form  =[u/V(2e)]e ”*. The normalization to one particle per 
unit volume corresponds (for spin-zero particles) to the normalization of the scalar by u*u = 1, and we 
can take simply u=1 (§10). In the following we define the transition current with respect to the 
amplitudes u1, u2, in accordance with the notation used in §64. 
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electron form factors: when q*<0, there are no intermediate states which could 
appear on the right-hand side of the unitarity relation, and so the matrix M,, (and 
therefore J;;) 1s Hermitian. 

If q =0, the initial and final states are the same, and J;; becomes a diagonal 
matrix element. In particular, e(J°);/2e; = eF(O) is the charge density, which is 
equal to the total charge Ze of the particle because of the normalization to one 
particle per unit volume. 

For an electrically neutral particle, F(0) = 0, but it must be emphasized that this 
does not imply a strictly neutral particle. If a particle is strictly neutral and has a 
definite charge parity, F(q*) =0 for all q’; since the current operator is charge-odd 
($13), its matrix elements between two states of the same hadron are zero.f 

Let us now go on to hadrons with spin 3. In this case the wave amplitudes uy, uy 
are bispinors, and the hadron current is 


Ji = ol uy. (138.6) 


From the bilinear combinations of &#) and u,; and the 4-vectors p; and p2, both true 
4-vectors and pseudovectors (satisfying the condition (138.4)) can be constructed. 
Hence the condition of P invariance of the interaction is not necessarily satisfied, 
and must be imposed separately.t As has been shown in $116, under this condition 
the vertex operator contains two independent and (if q? <0) real form factors. We 
shall now write it as 


pb 
lr’ =2M(F, - Fy)ptt Fy" 


2 
_ _@ gp \P hE ww 
=2M(F, jap) pe IM? & 


_ 2p _ 22 y" 2M ae yy 
(4M F, q Fin) pat spr (Fe m)O qi» (138.7) 


where F,(q’) and F,,(q”) are invariant form factors (M being the mass of the 
hadron). It is easily seen from the equation P* + q* = 4M’ and (116.5) that the three 
expressions in (138.7) are equivalent.§ 


+ This does not, of course, mean that such a hadron has no interaction with an electromagnetic field. 
The product of two current operations, J(x)J(x’), is charge-even, and its matrix elements are non-zero 
for transitions between states having the same charge parity. Thus a strictly neutral hadron can scatter a 
photon or simultaneously emit two photons, i.e. can take part in processes of a higher order in a. 

+t We shall not consider possible violations of parity conservation in electromagnetic interactions in 
consequence of virtual weak interactions. 

§ The convenience of defining the form factors as in (138.7) (R. Sachs, 1962) will be shown below. In 
the literature, use is also made of form factors F; and F2 defined similarly to f and g in (116.6): 


Tr?’ = Fyy" - ao” Oy. 


These are related to F, and F,, by 


F. = F,+ Foq’/4M°, Fm = F + Fo. 
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The electromagnetic form factors are among the invariant amplitudes defined in 
870. They may be regarded as the amplitudes of a “reaction” which is (in its 
annihilation channel) the decay of a virtual photon into a hadron and an antihadron. 
The virtual photon is a “particle” with spin 1. The fact that its decay into two 
particles with spin 3 must be described by two independent amplitudes is easily 
seen by calculating the corresponding helicity amplitudes (A,A,|S"]A,) (see 869): the 
P invariance means that the four non-zero elements of the S-matrix must be equal 


in pairs: 


(3]S'|1) = (~2—3|S'|- 1), 
(2— |S'|0) = ¢ — 23]S"0). 


The requirement of T invariance (or C invariance in the annihilation channel) 
imposes no further relations between these elements. This is connected with the 
fact that the interaction described by the vertex operator (138.7) is necessarily 
T-invariant also (but this does not apply for particles with higher spins). 

When q — 0, the terms of zero and first order (in q) in (138.7) are 


P* = Fy" 577 [Fn(0) — F.O)]o""G, (138.8) 


Hence it follows (see $116) that F.(0) = Z is the electric charge of the particle (in 
units of e), and F,,(0) — F,(0) is its anomalous magnetic moment (in units of e/2M).f 

So far we have used only form factors in momentum space. This is, of course, 
sufficient to describe the observed phenomena. Purely as an illustration, however, 
we Shall give a somewhat more intuitive interpretation of the form factors, 
regarding them as the Fourier transforms of certain functions of the coordinates. 
To do this, it is convenient to take a frame of reference in which P= p+ po =0 
(called the Breit frame); this is always possible, since P? >4M*> 0. In this frame, 
€,= €) =e, so that P°=2«, and the components of the 4-vector q are q°=0, 
q = 2p. = — 2pi. 

For a spin-zero hadron, the transition current in the Breit frame has a particularly 
simple form: 


Sii2e=F(-q), IJ=0. 


From this we see that F(—q’) may be interpreted as the Fourier transform of a 
static distribution of charges with density 


ep(r) =e any | F(—q’) e'** d3q. (138.9) 


In this sense, the particle is said to have a spatial electromagnetic structure: when 
F = constant = Z, we have p(r) = Zd(r), and the dependence of the form factor on q 


+ For example, the proton has F,(0) = 1, F,(0) — F.(0) = 1.79; the neutron has F.(0) = 0, Fin (0) = —1.91 
(the magnetic moment being entirely ““anomalous’’). 
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is interpreted as the difference between the charge distribution and a point charge. 
It must be stressed, however, that this interpretation is not to be taken literally. The 
function p(r) does not relate to any particular frame of reference, since there is a 
different frame for each value of q. 

The Breit frame is the same as the rest frame of the particle, and independent of 
q, only in the non-relativistic limit of small q’ << M7’, when the change in the particle 
energy in scattering is negligible. The initial and final states of the particle are the 
same in this approximation, and so the transition current becomes a diagonal 
matrix element, with p(r) the actual spatial distribution of charges. For the 
elementary particles, however, the values of {q| for which the form factors vary 
considerably are only slightly less than M. In the non-relativistic limit for these 
particles, we can therefore replace F(—q’) by F(0), i.e. regard the particle as a 
point. The situation is different for nuclei. The mass M of a nucleus is proportional 
to the number A of nucleons in it, and the typical value of |q|~ 1/R, i.e. is 
proportional to A~'? (R being the radius of the nucleus). Hence, for sufficiently 
heavy nuclei, the typical q’«<M7’, and so the non-relativistic treatment is per- 
missible throughout the significant range. Thus the concept of the electromagnetic 
structure of the nucleus becomes a quite definite one. 

For a spin-} particle, (138.7) gives in the Breit frame 


M__ : 
Si =( Po Fy) oe (fi2U,) + Fy (izy?U)) 
= F,(iiry’u), (138.10) 


If Fig X (dZu)), (138.11) 


a 
2M 


where » is the three-dimensional spin operator (matrix) (21.21), and in (138.10) the 
equation 


€ (ti?) = M (iin) 


has been used; this is easily verified by means of Dirac’s equations for u, and i, 
with Pi — —p2. 

The time component of the transition current (138.10) differs from the 
expression for a “point particle’ (an electron) by a factor F,(—q’). We can therefore 
say that the form factor F, (called the electric form factor) describes the “‘spatial 
distribution of charge”’ in accordance with (138.9). 

Similarly, the three-dimensional vector (138.11) can be correlated with the 
‘spatial distribution” of the current density ej(r) = curl p(r), where 


p(r) = sar > Fn(—q’)e'*'' d°q 


is the “magnetic moment density’. Thus the magnetic form factor F, may be 
interpreted as the magnetic moment spatial distribution density, of course with the 
Same reservations as were expressed above with regard to the charge distribution. 
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F,, includes both the “normal’’? Dirac magnetic moment and the ‘‘anomalous’”’ 
magnetic moment specific to the hadron, the “density” of the latter corresponding 
to the difference F,, — F.. 

It is reasonable to suppose that the singular points of the hadron electromag- 
netic form factors, like those of the electron form factors, occur at real positive 
values of the argument t = q*= —q’. From this we can derive certain conclusions 
as to the asymptotic behaviour of the distribution p(r) (and p(r)) as r> ©. The same 
transformation of the integral (138.9) as was applied in §114 to derive (114.4) from 
(114.3) gives the following result for large r: 


p(r) xe, 


where «@ is the abscissa of the first singularity of the form factor F(q’); cf. the 
footnote to §114. If the nearest singularity is given by the threshold for the 
production of a pair of hadrons (each with mass Mo) by the virtual photon, then 
Ko — 2 Mo. 


§ 139. Electron—-hadron scattering 


The formulae derived in 8138 can be applied to the elastic scattering of an 
electron by a hadron. Let the initial and final 4-momenta of the hadron be p, and 
Ph, and those of the electron p, and p{; then 


De + Dh = PetDh. (139.1) 


The process is represented by the diagram 
tq (139.2) 


The emission of a virtual photon by the electron corresponds to the ordinary vertex 
operator y, and its absorption by the hadron corresponds to the operator I. 

Let us take the most interesting case, that of a hadron with spin 3 (for example, 
the scattering of an electron by a proton or neutron). The diagram (139.2) cor- 
responds to the scattering amplitude 


My = —4e? z (aty"u, (HIT ttn); (139.3) 


in this chapter, the electron charge is —e. The calculation of the cross-section from 
this amplitude is essentially the same as the calculations in $81; the operator I is 
conveniently taken in the form of the first expression (138.7). 
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For the scattering of unpolarized particles, the result is 


Sas ra’ dt ” 
Ts —(M + my ][s — (M — my} — t/4M2) 
x | F2(s ~ uy + (4M2—- Dt] ayTYE F2[(s — u)?—(4M2—1)(4m? + if (139.4) 


where M is the hadron mass and m the electron mass, 


s=(pe+prayY’, t=q=(p—-py, u=(pe—-pi, 
sttt+tu=2m’*+2M°’. 


The following are some limiting cases. 

For the scattering of an electron by a heavy nucleus, an important case is that in 
which the momentum transfer |q| from the electron to the nucleus is small 
compared with the mass of the nucleus, but not small compared with 1/R (where R 
is the radius of the nucleus), so that the nucleus cannot be regarded as a point. In 
this case, the centre-of-mass system approximately coincides with the rest frame of 
the nucleus, the recoil of the nucleus may be neglected, and the electron energy is 
unchanged. Then 


—t=q <M’, — aldt|=pidol, 
s—_M’?=M’*—-u=2Me,, 


and formula (139.4) becomes 


ado! 


do = ra (4e2— q’)FA-q’). (139.5) 


In this approximation, the cross-section has only the term containing the electric 
form factor, and (139.5) corresponds to formula (80.5), which applies to the 
scattering of an electron by a static charge distribution. 

In the scattering of an electron by a neutron at rest, in the same limiting case 
&- <M (where M is the mass of the neutron), the form factors can be replaced by 
their values for q=0, since, as already mentioned, for a single nucleon the 
characteristic “radius” of the charge distribution is comparable with 1/M.7 Since 
the neutron is electrically neutral, F,(0) = 0, and the cross-section becomes 


OA. oD 
do = ap’ jas 1] dot 
= ap(cosec’ 78 + 1) dot, (139.6) 


Where p =(e/2M)F,,(0) is the magnetic moment of the neutron and @ is the 


+ The empirical value of the r.m.s. “radius” of the nucleon is about 3.5/M ~ 1/2m, (where m, is the 
pion mass). 
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scattering angle. This formula corresponds to the scattering of an electron by a 
point magnetic moment at rest. 

Finally, we shall give the cross-section for the scattering of an ultra-relativistic 
electron by a nucleon, with |q/>m. As before, q* denotes the square of the 
momentum transfer in the centre-of-mass system, and hence the invariant t = —q’. 


In the rest frame of the original nucleon (the laboratory system), we have 
—t ~2(p.p.) = 2e-e {1 — cos 9), 


where e, and e, are the initial and final energies of the electron, and 3 is the 
scattering angle in this system. In the ultra-relativistic case, eis related to 3 by the 
same formula as in the scattering of a photon (cf. (86.8)): 


ae, 
a a cos 0). 


Hence 


. I 
4e2 sin? 50 


~! = TF (2eJM) sin? $0” al 
d|t| = Eedoe (139.8) 
me TE Qed M) sin’? 3)” ' 


where do, = 27 sin 8 dd. In formula (139.4), we can everywhere omit the electron 
mass m; expressing all quantities in terms of t and s -M”=2Me,, we have 


do = EO pM | ~ 


ect? 4M’-t 
tt pa, ((4Me.t+ t) _ |} 
ag Fao |Gae—t |b (139.9) 
or, using (139.7) and (139.8), 
so = do ofa ____, 
“4e2 sin*s 0 14 (2e,/M) sin? 9 


2 wy p2 
. eee aqqi Patan’ 10} (139.10) 


1—t/4M? 


(M. N. Rosenbluth, 1950). 

Note that the form factors F, and F,, contribute independently to the cross- 
section, and there are no interference terms between them. This shows that the 
form factors have been appropriately chosen. 
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PROBLEM 


Find the cross-section for scattering of an electron by a hadron with spin zero. 


SOLUTION. Using (138.5), we have instead of (139.3) 
Ame? _ 
My, = ~~ Tr (i yPr)ue)F(q?). 


The cross-section is found to be 


wa’ dt[(s —u)+(4M’—-t)t] 2» 
40 is (M+ mylls—(M—myye 


with the same notation as in (139.4). When |t|> m/’, 


‘ a” cos?30 F’(t) 


o> Genet aaa T: (2¢-/M) sin?390 


with the same notation as in (139.10). 


§ 140. The low-energy theorem for bremsstrahlung 


In §98 we have investigated the emission of a photon in a collision of particles, 
in the limit of zero photon frequency, and found that the amplitude of the process 
is inversely proportional to w and can be simply expressed in terms of the 
amplitude for the same collision without emission of a soft photon; the latter will 
again be conventionally referred to below as the amplitude for “‘elastic”’ scattering 
and denoted by M§?”. In the next approximation with respect to w, 


My = M$? + M}?, (140.1) 


where a correction term independent of w (« w°) has been added to the principal 
term («w'). We shall see that this correction term also, like the principal term, 
can be expressed in terms of M‘’, and that this is true whatever the detailed 
electromagnetic structure of the hadron. This result is called the low-energy 
theorem for bremsstrahlung (F. E. Low, 1958). 

We have seen in §98 that the main contribution to the amplitude for emission of 
a soft photon (corresponding to the first term in (140.1)) arises from diagrams in 
which the photon is emitted by the initial or final particle. These are diagrams of 
the form 


(140.2) 
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in contrast to those of the form 


(140.3) 


where the photon line comes from the internal parts of the diagram. A charac- 
teristic feature of the diagrams (140.2) is that they can be divided into two parts by 
cutting only one (initial or final) virtual-hadron line. Thus they illustrate an 
important property: there exists a one-particle intermediate state with one hadron. 
We have seen in §79 that, because of the unitarity conditions, this property 
necessarily causes a pole singularity of the amplitude. 

Let us assume for simplicity that only one (denoted by the subscript 1) of two 
colliding hadrons has an electric charge and therefore can radiate, and that neither 
hadron has any spin. The wave amplitudes u of such hadrons are scalars, which 
will be taken as unity. Then the contribution of the pole part of the diagram 
(140.2a) to the amplitude is 


1 
NA (2) — * ee ae : 
iM‘? = V(4r)e*(2pt —k") eF (=k iT. (140.4) 


The first factor corresponds to the photon k (e, being its polarization 4-vector). The 
second factor corresponds to the electromagnetic hadron vertex (the black dot in 
the diagram), and is written in the form (138.5), with F the hadron form factor. The 
third factor is the propagator of the virtual hadron p,;—k (M being its mass). 
Finally, the factor if! denotes the whole remaining section. This differs from the 
amplitude of the elastic process 


P Py 
iM“. = (140.5) 
P; Ps 


in that the real hadron p,; is replaced by the virtual hadron p, — k. 

The first few terms in the expansion of (140.4) in powers of w include (1) terms 
inversely proportional to w, (2) terms independent of w but depending on the 
direction of k, (3) terms independent of both w and k. The terms of the third kind 
only are given also by non-singular diagrams of the type (140.3), which do not have 
a pole singularity, and by the non-pole parts of the diagrams (140.2). We shall see 
that all such terms of this sort are jointly and unambiguously given by the terms of 
the first and second kinds when the condition of gauge invariance 1s applied, so that 
no separate calculation of these terms is necessary. 

The amplitude of the elastic process (140.5) depends only on two invariants: 


S = (pit po) = (pit ps)’, 


140.6 
t = (p5— po). 
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The replacement of p,; by p;— k not only changes s into (p; — k + p2)’ but also brings 
in a dependence on a further variable, 


(pi— k)’— M’ = — 2pik, 


which represents the “non-physicalness”’ of the momentum p,—k. But the first 
term in the expansion in powers of this new variable (a small quantity) already 
eliminates the singularity in the amplitude (140.4), and therefore can yield in this 
amplitude only terms independent of k, which according to the foregoing discussion 
are not yet relevant. Thus we reach the important conclusion that I in (140.4) can 
be replaced by the physical amplitude M‘"(s, t) with the change 


s > (pi t+ po—k’) = s — 2k(p, + p>). (140.7) 
The first terms in its expansion are given by 
P > Ms, t)— 2(kp: + kp2)(aM/as),. 


For a similar reason, it is unimportant that the electromagnetic form factor F 
here relates to a vertex at which only one of the two external hadron lines (p; and 
p1—k) 1s physical. The form factor can therefore be replaced by the one described 
in §138, for a vertex with two physical external lines; since the photon k is then a 
real photon, we have F(k’) = F(0) = Z;, where eZ, is the hadron charge. 

Thus (140.4) gives 


* el) 
MIP = ZeV der) Sy GPd — ZeV Aer) Xe* pi) yep AUP) OMI ee, (1408) 


where the dots represent terms independent of k (whereas the second term in 
(140.8) depends on the direction of k). Similarly, we find that the contribution to M,; 
from the diagram (140.2b) differs from (140.8) in that p;, p2 and k are replaced by 
Pi, p2 and —k. The leading term in the expansion is the already familiar expression 


(cf. (98.5)) 


! yn * * 
MS” = ZieV(4e) Gia : re) M®). (140.9) 


The terms independent of k can be found from the condition for the amplitude 
as a whole to be gauge-invariant: it must be unaffected by the change e* > e* + 
constant X k, i.e. must have the form M, = e*J", with k,J* =0. It is easy to see 
that, for this to be so, we must add to (140.8) the term independent of k 


—~2Z,eV (477)(p2e*), 
and similarly for the diagram (140.2b). The final result is 


os 


Mi =2ZieV(4myek | pt 2 — pe + pin PED — py (140.10) 


(pik) ' (pik) 
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The problem can be solved by means of this formula, which may be more 
compactly written by using the identity 


2v Vv 
as Op) ae P2,P2 


and similarly for 0/dp;, and the differential operators 


ly = 745 res (140.11) 
and similarly for di,. Then 
M = Z,eV (47) eX(dt + di") M@. (140.12) 
The cross-section is given by |M,i|*; to the appropriate accuracy, 
|Myi\’ = |M&?P +2 re (M$, 9M }*). (140.13) 


The second term gives the required correction to the emission cross-section. 
Summation over the polarizations of the photon gives as the value of this cor- 
rection 


—4a(Zye) (J -7) (dj +d,), |M/. (140.14) 


Thus the correction to the emission cross-section is expressed in terms of the 
cross-section for the elastic process and its derivative with respect to s. 

If the charged hadron has spin 3, the calculations are unchanged in principle; 
only the specific form of the vertices and propagators is altered. It is found that 
formula (140.14) remains valid after averaging over the polarizations of the hadrons 
and the photon (T. H. Burnett and N. M. Kroll, 1968). 


§141. The low-energy theorem for photon—hadron scattering 


In the limit of low frequencies, the cross-section for the scattering of a photon 
by any charged particle at rest tends to its classical value given by Thomson’s 
formula. This limit corresponds to an amplitude independent of the photon 
frequency w, which we denote by M$. It is found, however, that not only the first 
term in the expansion of the serfiitudes in powers of a, 


M;: = M9? + M9. (141.1) 


but also the next term (M“” ~ w), are independent of the specific electromagnetic 
structure of the hadron for photon scattering, as well as for the bremsstrahlung 
discussed in §140 (F. E. Low, 1954; M. Gell-Mann and M. L. Goldberger, 1954). 
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This process is represented by diagrams of three types: 


aa ee ky ye via 
‘ ‘ ; : WA, 141.2 
= as JA (141.2) 
p p p’ ’ 
b P Pp . p 


of which the first two again have a one-particle intermediate state, and therefore a 
pole singularity. 

The analysis and the principle of the calculations are the same as in $140. In 
practice, we need only determine the contribution from the pole parts of the 
diagrams (141.2a) and (141.2b), expressing their electromagnetic vertices in terms 
of the static form factors (the charge Ze and the anomalous magnetic moment pan), 
as in (140.15). 

Unlike the bremsstrahlung case, however, the corrections to the Compton- 
effect cross-section are important only for particles that have spin. This is because, 
for bremsstrahlung, as well as the spin-dependent corrections, there are corrections 
arising from the energy dependence of the amplitude of the “elastic’”’ process. In 
photon scattering, this amplitude is replaced by form factors which, for “physical 
external lines’, are constants independent of the energy, and therefore the cor- 
reactions arise only from the magnetic moment, which is zero for spinless particles. 
We shall discuss the scattering of a photon by a spin-; hadron. 

If M;; denotes the contribution of the pole diagrams to the scattering amplitude, 
then (cf. (86.3), (86.4)) 


My = — 41(Ze)’e),*e,(iQ”’u), (141.3) 
where 
lip Heh a en, eT I ie. se Bs = TM Bie eas 
Qe NY ES) cogs (YS Sage YS) 41.4) 
s=(ptky=(p'+ky, u=(p—-k'y=(p'—ky, 
and for brevity we have put 
io ky= Les”. pao k= Zes'”. (141.5) 


By interchanging the operators yp+M and using the equations i'(yp’—M)= 
(yp — M)u =0, we can transform (141.4) to 


Qu = cy" i gn Oy" + 2p, y'(yk) = 2p” Gee s")] _ 


2pk 2p'k 
Pye Ok 2p ey Oe" | - 
TT) i 
fon Wt+yYK+M op a yp yk'+M | 
E CS se 8 (141.6) 
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This form, and the corresponding one with k and k’ interchanged, clearly show that 
(141.3) is gauge-invariant; the relevant condition is 


ka’ Q"’u) = (a'Q“’u)k, = 0, (141.7) 


in verifying which it must be remembered that (yk)(yk) = 0, kS = k'S' = 0. 

Since the pole part of the scattering amplitude is thus gauge-invariant by itself, 
so must be the regular part of the amplitude (which includes the contribution of the 
diagram (141.2c)). Hence in turn it follows that the expansion of this part in powers 
of k and k' must begin with quadratic terms; cf. the similar comment relating to the 
condition (127.5). That is, the regular part of the amplitude includes only terms 
starting with those proportional to ww'~w’, and makes no contribution to the 
terms concerned here, which are proportional to w° and w'. These are therefore 
included in (141.3). 

To calculate the terms in question, we use the laboratory system, in which the 
initial hadron is at rest. For the photons, we take a three-dimensionally transverse 
gauge, in which eg =e)=0. Then pe =0, p'e’* ~ |p'|~ , and from (141.6) it is 
obvious that the leading terms in the expansion of M;; will be proportional to w, 
and that the terms in p,, will contribute only to the w' terms. 

The wave amplitudes of the initial and final hadrons in the laboratory system 
are, with the necessary accuracy, 


u = V(2M) (e): i’ = V(2M) fw, FF kk) oF], 


where w and w’ are three-dimensional spinors. 
A straightforward calculation gives the result 


M$? = —87(Ze)*(e’* - e)(w'* w), (141.8) 


M$? = — 167iMw2,w(w’* ow): (n’ X e'*) X (n X e) — 
— 47iZewa,w(w'* ow) {n(n X e: e’*) + 
+(nXe)n:e’*—n'(n' Xe™- e)— 
—(n’ X e’*)n- e — 2e’* X e}, (141.9) 


where n=k/o, n’ = k'/w’. 
The scattering cross-section 1s 


a I 2 wo" r. 
do = 6453 Mii Mw? do . (141.10) 


see (64.19). For scattering by a charged particle, both M$) and M$? are non-zero. 
The accuracy used allows us to retain in |M,|’ the terms |M}?|’ and re(M% M$?*). 
The first of these gives the Thomson scattering. The second becomes zero on 
averaging over the polarizations of the photons and hadrons. In scattering by a 


QE4 - PP 
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charged hadron, therefore, the corrections under consideration occur only in the 
polarization effects. 

For scattering by an electrically neutral hadron, M$? = 0 and the cross-section is 
determined by |M}$)|’. After averaging over the polarizations of the final particles 
and summing over those of the initial particles, it is (in ordinary units) 


2 *w? 
a 


(2+ sin’? 3) do’, (141.11) 
where 9% is the photon scattering angle and the anomalous magnetic moment is 


equal to the total moment p. The angle dependence of this cross-section is the 
same as for antisymmetric scattering (see 860, Problem 2). 


§ 142. Multipole moments of hadrons 


Let us now consider the transition current corresponding to a diagram of the 
same kind as (138.2): 


fe (142.1) 
Ps t 


but with the lines p; and p> pertaining to different particles (masses M; and M)); the 
photon line k = p,—p2 will be more conveniently represented here as leaving the 
vertex. The photon may be either virtual or real, the only necessary condition being 
k*<(M,— M,)’, so that the value k? = 0 is permissible. Thus the applications of this 
diagram include, in particular, processes of photon emission in transformations of 
nuclei as well as other particles (for nuclei, the initial and final particles are the 
same nucleus in different states). 

The most interesting case here is that in which the wavelength of the photon is 
large compared with the characteristic “dimensions” of the particle (i.e. those 
which appear in its form factors, equal to the ‘‘radius’’ in the case of a nucleus). 
Then the transition current can be expanded in powers of k.t 

Note first of all that we must have 


J, = 0 when k = 0, (142.2) 


since the limit k > 0 corresponds to a potential constant in space and time, but such 
a potential has no physical significance and cannot give rise to real processes. The 
same conclusion can be reached by a more formal argument: the currents discussed 
in §138 were non-zero for k=0 on account of the terms proportional to the 
4-vector P = p,+p2, but when M,;# M; the product Pk#0, and such terms are 
therefore forbidden by the condition for the current to be transverse. 


+ The following treatment is due to V. B. Berestetskii (1948). 
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This condition for the current Jj; = (pp, Jf) is, in three-dimensional form, 


k - Jp = wpyi, (142.3) 
and can be satisfied in two ways: 
Ji = wv(k, w), pfi = k - V(k, w) (142.4) 
or 
Jj, =k X alk, w), pri = 0. (142.5) 


Here v is a polar vector and a an axial vector. The current is said to be of the 
electric and magnetic type respectively. According to (142.2), v and a are finite or 
zero when k, w—>0. 

Let the photon energy w <M,. Then the recoil may be neglected, and the final 
particle M, also may be regarded as being at rest (in the rest frame of M)); 
w = M,—M, Is given quantity. The states of the particles M, and M, at rest are 
specified by three-dimensional spinors w, and w, of ranks 2s; and 252, where s, and 
Ss. are the spins of the particles. The transition current must be a bilinear com- 
bination of w, and w%. From the products of the components of these spinors, we 
can form irreducible tensors with ranks | = s,;+ 52,...,|S;— S|; for a given I, they 
are true tensors or pseudotensors according to the internal parities of the particles 
M, and M). Apart from these tensors, we have available only the vector k. In order 
to obtain the first term in the expansion of the current in powers of k, we must 
form from these quantities a vector of the lowest possible power of k. This is done 
by taking the tensor of lowest rank and contracting it | — 1 times with the vector k. 
This will give the polar vector v or the axial vector a. 

Let Qim be the spherical components of the tensor formed from the wave 
amplitudes of the particles. The spherical components of the tensor of rank | —1 
formed from the components of k are |k|'"' Y;-;,n(n), where n = k/@. From the general 
rule for the addition of spherical tensors (see QM, (107.3)), the spherical components 
of the vector v may be written 


apy WA 2i+1, 4. 1-1 1 l 
v, =(-1)’ ST —— IkI 2 4 +m -—X ia ie ereAD: 


where A takes the values 0 and +1; the choice of the common factor 1s explained 
below. Using formulae (7.16), we can express v in terms of spherical harmonic 
vectors: 


I-1 
vai Ra Ep > CD Q-mIVI+ DYKE) + VIVine@)]. (142.6) 


Substitution in (142.4) gives the El transition current: 


_ 1 VGmolkl ayer : 
Jn= i! paver ny & OD "@PalV + DY ina) + VY in(a)], (142.7) 


r V (427) |k|' 


pri = 21—DIVQl+1 » (1) "Q 4? nm Yim Cn); (142.8) 
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|k| and w are distinguished in each formula, with a view to possible applications to 
real photons and also virtual photons, for which the two quantities are not equal. 

In (142.7) and (142.8) it is assumed that the spherical tensor Q,,, (here denoted 
by Q‘) is a true tensor. If it is a pseudotensor (denoted by Q{”), then (142.6) defines 
the pseudovector a, and substitution in (142.5) gives the MI transition current: 


1 W427) 1+1 


n=" OF pnViai+h 


KI > (DI Qin ¥ in'(n), 
| | > ( yn"Q? im (MN) (142.9) 


pri = 9. 


The quantities Q{ and Q{” are the hadron electric and magnetic multipole 
transition moments. Their role in hadron electrodynamics is exactly analogous to that 
of the corresponding quantities in electron electrodynamics. However, for electron 
systems these moments can in principle be calculated from the wave functions (as the 
matrix elements of the corresponding operators), whereas in hadron electrodynamics 
they occur as phenomenological quantities whose values are determined from 
experiment. 

The normalization of these quantities in (142.7)-(142.9) 1s chosen so as to agree 
with their definition in §46. This can be verified by regarding the currents (142.7)- 
(142.9) as Fourier components of the transition current in the coordinate represen- 
tation. For example, expanding the factor e ‘“’" in the integral 


pni(k) = | pa(rye ** d?x (142.10) 
by means of (46.3), we get 
p(k) = Ari! 5 Yin(a) | pyle) ¥tn(ele)gi(klr) d°x. 


Retaining the term with the smallest value of | such that the integral is non-zero, 
and replacing g, (|k| r) for |k|r <1 by the first term in its expansion (46.5), we return 
to (142.9), with 


Qi? = x i r'psi(r) Yim (r/r)d°x, (142.11) 


in agreement with the definition (46.7). 

It can also be shown that, when applied to the emission of a real photon, the 
formulae derived above give results already known. The amplitude of a transition 
with emission of a photon having momentum k= wn and polarization e = (0, e) is 


My = — eV (4rr)e* + Sj. (142.12) 
If the nucleus has definite values of the angular-momentum component M; and M,; 


in the initial and final states, only one term remains in each sum over m in 
(142.7)-(142.9), namely that with m = M; — M,;. Since, from (16.23), the products 
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Y{2-e* and Y(™ - e”* (where A = + 1 is the helicity of the photon, and e™ Ln) are 
proportional to Di, we obtain again the formulae given in §48. 
The differential emission probability ist 


dw = 275[o — (E; — Ep) |Mg|’a?k/2@(277)', (142.13) 


where E; and &, are the initial and final energies of the nucleus. The total 
probability is found by summation over the polarizations and integration over d°k. 
Substituting (142.7) or (142.9) in (142.12) and thence in (142.13), and performing the 
operations just mentioned, we again obtain (46.9) or (47.2). 

Fermulae (142.7)-(142.9) include all possible cases of the emission of a real 
photon. For virtual photons, there is another possible case which they do not 
include (R. H. Fowler, 1930). 

If the spins and parities of the initial and final states of the nucleus are the 
same, we can obtain from their wave amplitudes a scalar Qo, and from this a 
transition current of the form 


pri = Qok’, Js, = Qowk. (142.14) 


Qo is called the monopole (EO) transition moment. The corresponding transition 
amplitude for the emission of a real photon is zero, since e* -k =0. The monopole 
current, however, may give rise to transitions involving the emission of a virtual 
photon. It is, moreover, the only such source when s; = s2=0 and all the multipole 
moments are Zero. 

The monopole current (142.14) is analogous to the electric quadrupole 
current as regards its dependence on w and k. Accordingly, the moment Q) also is a 
quantity of the same order as the quadrupole moment. The same conclusion can be 
reached by regarding (142.14) as the Fourier components of the current in the 
coordinate representation. Using in (142.10) the expansion of e “’' in powers of 
k-r and assuming that p;\(r) is spherically symmetrical, we get 


pri(k) = —6k’ | palr)r? d°x. 
Comparison with (142.14) shows that 
Qo= -5 | pyi(r)r? d?x. (142.15) 
The similarity to the quadrupole moment is obvious. 


PROBLEMS 


PROBLEM 1. Find the probability of ionization of an atom from the K shell because of the 
excitation energy w of the nucleus (called internal conversion of y-rays), in an MI nuclear transition, 


+ The factor 278 in this formula, replacing (27)‘5 in (64.11), arises because momentum is not 
conserved when the recoil of the nucleus is neglected, and so only the conservation of energy remains. 


642 Electrodynamics of Hadrons § 142 


neglecting the binding energy of the electron in the atom and the influence of the nuclear field on the 
wave functions of the nucleus.T 


SOLUTION. The process is described by the diagram 
SO 
44 
ra 
" Pp 


where p; and p2 pertain to the nucleus at rest in different states, and p =(m,0) and p’=(m +, p’) are 
the 4-momenta of the initial and final electrons. This diagram corresponds to the amplitude 


(1) 


Pp 


4 
My = a re: ii(p')\(yJj)u(p), 


where J; is the transition current of the nucleus. After summation over the final polarizations of the 
electron, and averaging over its initial polarizations, we get 


16 
2 > IMil’= eT os E 7a" ePh) + 4p PhP) 
using the fact that Jjiq = 0 and therefore Jjp = Jp’. The conversion probability is calculated from 
dWeonv = 2|uh.(0)/ (2 do 
m p-0 


where do is the cross-section for the scattering process represented by the diagram (1) with p = (e, p), 
and y, is the wave function of the atomic electron; for a K electron |y,(0)|’ = (Zam) |. The factor 2 
takes account of the two electrons in the K shell of the atom. The cross-section do is 


Eat ' 2 d°p' . 
do = 278(e + w — &')|Myi| pl deny 
cf. the last footnote. 

For MI transitions, the current Jj must be taken from (142.9). The infceralion of dWeonyv over de’ 
im by unity. The conversion 
probability is thus expressed in terms of |Q{\™),|". But, according to (46.9), the probability w, for the 
spontaneous emission of a photon in the same nuclear transition can be expressed in terms of the same 
quantity. The final result is 


1+} 
Mooi 2 5 (7a) ™ (1+=2) 
Wy 


this ratio being called the conversion coefficient. 


PROBLEM 2. The same as Problem 1, but for an El nuclear transition. 


SOLUTION. By the same method, with the transition current given by (142.7) and (142.8), we obtain 


Weonv I m 2m ea 
Wy =2a (Za)*(1 +m) ] : 


+ This approximation implies that the nuclear charge is small and the excitation energies w are 
sufficiently large (but 1/w is assumed large compared with the dimensions of the nucleus). In practice, 
the approximation is somewhat unsatisfactory, and a more precise calculation has to take into account 
the Coulomb field of the nucleus. 
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PROBLEM 3. The same as Problem 1, but for a monopole nuclear transition. 


SOLUTION. With the transition current given by (142.14), the result is 


3/2 
Weonv = 1607(Za)?mew* (1 + “| |Qol’. 


0) 


Since monopole emission of a photon is impossible, |Qo|’ cannot be eliminated. 


§ 143. Inelastic electron—hadron scattering 


Elastic electron—hadron scattering has been discussed in §139. The problem of 
inelastic scattering may be formulated similarly. The only difference is that the final 
hadron state now corresponds to another hadron or several hadrons. The law of 
conservation of momentum (139.1) remains valid if p; denotes the 4-momentum of 
the final hadron or the total 4-momentum of the group of hadrons formed in the 
scattering process. We thus have now pj}, # p; = M’, where M is the mass of the 
initial hadron. 

With this difference, the inelastic scattering process is described by the same 
diagram (139.2). The lower vertex of the diagram is denoted by J; as in 8138. 
However, in contrast to (138.3) or (138.6), we shall not express the transition 
current in terms of the vertex operator and the amplitudes of the states, in order 
not to specify in advance the nature of the final hadron state. 

We can now write the scattering amplitude in a form analogous to (139.3): 


A4qre* = 
My = — os (Ue ee ) Ji; (143.1) 
f Coy Yu Ji 


a similar amplitude has already been used in §142, Problem 1, where energy 
transfer to an electron was considered, and the amplitude has a similar structure in 
the problem of the excitation of nuclei by electrons. 

We shall assume that the initial electron energy is so high that many hadrons 
can be formed in the final state, and consider the “inclusive” cross-section, for 
which only the electron momentum in the final state is fixed, with summation over 
all the hadron states. This differential cross-section is written as follows, in 
accordance with the formulae of §64: 


d*pi 


sa 41 (27) 2e), 


2 (27r)'6 (Dy + Pi De — D)|Myil’. (143.2) 


The inclusive cross-section can depend only on three kinematic invariants, 
which may be determined by measurements on electrons only. The three invariants 
are 


t=q=(p.e—py’, sS=(pet+Dn), (143.3) 


and pj”. The need to include the third invariant arises because, in contrast to the 
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case of elastic scattering, pj’, the ‘‘mass’” of the final hadron state, is now 
unspecified. Instead of it, however, another invariant is more convenient, namely 


v= GPh- (143.4) 
The relation between v and pj} follows from p},= prt a: 
pi? = M?+t+2bv. (143.5) 


If the initial hadron is stable (for instance, a proton), the rest energy of the final 
state exceeds M, i.e. pj? = M7’, and from (143.5), since t <0, we have 


y= Hel, (143.6) 


the equality occurring for elastic scattering. 

The kinematic invariants can be expressed in terms of the electron energies e, 
and e, in the initial and final states and the scattering angle @. We shall assume the 
electron to be ultra-relativistic (e. >m,e.>m), and neglect its mass. Then, in the 
rest frame of the initial hadron (the laboratory system), 


t= —4¢,e,sin?30, v=M(e.—6)), s—-M’=2Me. (143.7) 


Substituting (143.1) in (143.2) and summing as usual over the electron polariza- 
tions, we obtain the scattering cross-section for unpolarized electrons, which we 
write as 


do = ae aay a WW”, (143.8) 
or 
om ay dnt” Greed 
where 
Wuv = 4DepPev — (Dew + PeoQu) + Suv» (143.10) 
wes 2 (27)'6 (Dh Pn — QI RIF. (143.11) 


The tensor W”” of course depends essentially on the properties of the hadron 
currents, and in general we can only pose the problem of its phenomenological 
structure, similarly to the problem of hadron form factors. We first use the fact that 
the tensor structure of W*” must be determined only by the 4-vectors relating to 
the lower vertex of the diagram (139.2), 1.e. p, and g. From these (and the metric 
tensor g,,) five independent tensors can be constructed. The requirement of 
invariance under time reversal means that the tensor must be symmetrical, and 
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there are four such. Lastly, the condition of current conservation, 1.e. 
W*"q, = 9, wd, = 9, 


reduces the number of independent tensors to two. These may be taken as 
vie) = AER Bay Tht = (Dru — PGult)(Pro — Gul), (143.12) 


and W,,, be written as 
W,» = 4aMW, 7? + (40/M) Wo. (143.13) 


Substituting (143.10) and (143.13) in (143.8), we put the cross-section in the form 


do = (W>+2W, tan’ 30) de} doa, (143.14) 
where 
_ a? cos’36 , , 
OCa = gerrealg °° 


is the cross-section for scattering of an ultra-relativistic electron in a Coulomb 
field; cf. (80.7). 

We see that the cross-section is determined by two structure functions which 
depend on the two invariants t and v. If the physics of hadrons at high energies 
does not contain characteristic quantities having the dimensions of mass (the 
hypothesis of scale invariance), we may expect that the structure functions 
depend, at high energies, on the only dimensionless parameter, t/v. Then the 
functions W,, W, must be functions of one variable: 


Wi=(M/v)Fi(tly), = W2=(M/v) Fl t/v); (143.15) 


the ratio M/v is independent of M. 


§ 144. Hadron formation from an electron-positron pair 


Let us now consider the transformation of an electron—positron pair into 
hadrons. We denote the 4-momenta of the electron and positron by p- and p., and 
the (total) 4-momentum of the hadrons formed by pp, with p_ + p+ = p,. The process 
is represented by the diagram 


(144.1) 
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The lower vertex here corresponds to the transition current from the vacuum to a 
hadron state |n), which we denote by (n|J|0) as in §104. 
The diagram (144.1) corresponds to the scattering amplitude 


M, = —(4mralq?)it(—p)y,4(p_)n(n|J"|0). (144.2) 
We shall consider the total cross-section o, for annihilation into hadrons, 1.e. sum 
over all final states |n). Then, in accordance with (64.18), 


l 
On = 3p 2 [Mal 2ary'3CPn — 4), (144.3) 


where q = p_+p:+. The mass of the electron will henceforward be neglected; then 


q? = 2(p_p,), 1 = 14’. 
As in 8143, we write the cross-section in the form 


CO, = (47)w'W,,/20, (144.4) 
where 
w'” = a(ptq’ + p2q* — 2p*p* —3q°g""), (144.5) 
Wav = & > (277/58 (pn — GOS, [nXn|J,,|0) (144.6) 
and t= q’>0. 


Note that t is the only kinematic invariant in this problem, with the three-ended 
diagram (144.1), and q is the only 4-vector on which W,, can depend. Hence, by 
the requirement of current conservation, the tensor W,,, may be written 


W,, = 4pn(t) (ees tu) (144.7) 


where p,(t) 1s the only invariant function, depending on the properties of the 
hadron current and determining the annihilation cross-section. Substitution of 
(144.5)-(144.7) in (144.4) gives 


on = (417*a/t”)pp(t). (144.8) 

Note that the function p,(t) = —3Wt is exactly the same as p(t) defined in 

(104.9) if the currents in the latter equation are taken to be hadron currents. 
Moreover, p(t) is the spectral density of the self-energy function [I(t): 


im I(t) = — 7(t). 


In the lowest approximation with respect to a, which is being considered here, the 
function II is the same as the polarization operator Y. In this approximation, 
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therefore, p,(t) is also the spectral density of the hadron contribution to the 
polarization operator: 


im Y;,(t) = —7p,(t). (144.9) 


Using the dispersion relation (111.13) and expressing p, in terms of o, by (144.8), 
we get 


foe) 


oe ele o;,(t’) dt’ 
P(t) = zo, | BES + 10° (144.10) 
0 


which expresses the hadron contribution to the polarization of the vacuum in terms 
of the measured cross-section for annihilation into hadrons. 

Note that we could, in exactly the same way, solve the problem of electron— 
positron pair annihilation to form a muon pair (in the first approximation with 
respect to a, only one such pair is formed). Corresponding to the formula (144.8), 
the result is 


o,, = (47 a/t”)p,(t), (144.11) 
where p,(t) is the spectral density of the muon polarization of the vacuum. It 
differs from the electron polarization only in that the electron mass m is replaced 
by the muon mass yp, and, from (113.8), it is 


pu(t) = (a/3a)(t + 2n?)V (1 — 4p7/t). 


Substitution in (144.11) brings us back to the result already derived in §81, Problem 
8. 
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